What LEGO Data Reveals About the Companyโ€™s Growth

Exploring 70 Years of LEGO History Using Python and Data Analysis

Posted by Philip Aron Barlaan on March 05, 2026

๐Ÿงฑ A Simple Question That Turned Into a Data Project

LEGO has been around for more than 70 years, but the company we see today looks very different from the one that existed decades ago.

As someone who enjoys both LEGO and programming, I became curious about what the data could tell us about how the company evolved over time.

Instead of just looking at individual sets, I wanted to answer a few bigger questions:

  • Has LEGO been releasing more sets over time?
  • Did the number of themes grow as the company expanded?
  • Are modern LEGO sets becoming more complex?

To explore these questions, I analyzed a large LEGO dataset using Python.

๐Ÿ“Š The Dataset

The dataset comes from Rebrickable, a website that maintains one of the most detailed LEGO databases available.

It contains information about thousands of LEGO sets, themes, colors, and parts.

For this project I mainly worked with three datasets:

  • sets.csv โ€“ information about every LEGO set
  • themes.csv โ€“ the category or theme each set belongs to
  • colors.csv โ€“ color data for LEGO pieces

One interesting challenge was combining the datasets together. The sets dataset only stored a theme ID, so I had to merge it with the themes dataset to get the actual theme names.

This is a simple but important example of working with relational data in Python.

๐Ÿ“ˆ LEGO Set Releases Over Time

One of the first things I looked at was how many LEGO sets were released each year.

The trend was very clear: LEGOโ€™s production has grown dramatically over time.

In the early decades, only a small number of sets were released each year. But starting around the 1990s and especially in the 2000s, the number of sets increased significantly.

This reflects how LEGO expanded its product catalog and started targeting a wider audience.

๐ŸŒ The Rise of LEGO Themes

Another interesting trend was the growth of LEGO themes.

Instead of selling only generic building bricks, LEGO began creating themed experiences such as:

  • Space
  • City
  • Pirates
  • Licensed franchises like Star Wars and Harry Potter

The data shows that as LEGO released more sets, it also introduced more themes. This allowed the company to reach different audiences and keep the brand fresh and exciting.

๐Ÿง  Are LEGO Sets Getting More Complex?

One question I found especially interesting was whether modern LEGO sets are becoming more complex.

To explore this, I looked at the number of parts included in sets over time.

The results showed that part counts have generally increased over the years.

This suggests that LEGO sets have become more detailed and sophisticated, which aligns with the growing popularity of collector sets and adult-focused LEGO builds.

Today, many LEGO sets are designed not just as toys but also as display pieces.

๐Ÿ’ก What This Analysis Suggests

Looking at the data overall, a few patterns become clear:

  1. LEGO releases far more sets today than in the early decades.
  2. The number of themes has expanded significantly.
  3. Modern sets tend to be larger and more complex.

Together, these trends show how LEGO evolved from a simple toy manufacturer into a global entertainment brand.

๐Ÿš€ Final Thoughts

This project was a fun way to combine programming, data analysis, and curiosity about a brand many of us grew up with.

Using Python libraries like Pandas and Matplotlib, I was able to explore patterns in the data and turn them into visual insights.

If you're interested in the full analysis and the code behind this project, you can check it out here:

๐Ÿ‘‰ View the full project on GitHub

Projects like this remind me that data can tell interesting stories โ€” sometimes even about something as simple as LEGO bricks.