May 06

AECData Python Library Step 3— Visualizations & Statistical Analysis

We go step by step and explain how to start with the filtering and retrieval of data on aecdata (see code below).

Visualizing Data - Plotting Graphs and Deriving Statistics

Done with data retrieval, on with the stats!

Welcome to the third tutorial on using the open-source AECdata library provided by 2050 Materials.

In this tutorial, we’ll learn how to plot visualizations and derive statistics from your data. This guide will cover grouping data by category and location, removing outliers, and calculating median values and quartiles. Plus, we’ll show how to create a distribution plot.

Setting Up Your Environment

Before diving into the statistics and plots, ensure you’ve imported the necessary classes from the aecdata library:

from aecdata import ProductData, ProductStatistics import pandas as pd

Initializing the ProductStatistics Class

Start by creating an instance of the ProductStatistics class. This class extends the functionalities of the ProductData class, allowing for advanced data analysis.

Grouping and Filtering Data

One of the powerful features of the ProductStatistics class is its ability to group and filter data efficiently. Here’s how you can do it:

Data Table - Analyzing and Organizing Data

This code groups the data by country and material type, which is particularly useful for regional analysis and comparisons between different materials.

Outliers can skew the results of your data analysis. The ProductStatistics class includes methods to remove these effectively. Notice above in the get_statistics() function the parameter remove_outliers=True

Plotting Data Distributions

Visualizations can help understand the distribution of data. Let’s plot a histogram and a boxplot:

Distribution of Material Facts in Manufacturing

Distribution of Material Facts Manufacturing by Product Type

These plots will provide visual insights into the distribution and variance of the impact factors across different material types.

Done, for now!

You’re now set up with aecdata and have used theProductStatistics class, to perform detailed statistical analysis and visualizations.

This tutorial covered grouping data, removing outliers, and visualizing distributions, which are crucial for making informed decisions based on your data.

Stay tuned for our next tutorial, where we’ll go over how to implement aecdata within a data-science environment!

Stay tuned, and happy coding!

This library is provided by 2050 Materials, a company dedicated to unlocking the value of data in the construction industry to enable the climate transition.

If you are interested in embedding this data within your workflows, or have a specific problem, reach out to us at api@2050-materials.com

Previous AECData Python Library Step 2— Get filters and retrieve product data
Next LCA tools in the built environment — VERT WLC Tool, powered by 2050 Materials

Related articles

A variety of sustainable flooring materials showcased on the 2050 Materials platform, highlighting eco-friendly and low-carbon options.
Data & Research Dec 17

A Guide to Flooring Products with Environmental Certificates

This article highlights some of the most innovative and flooring products available today, exploring their material composition, environmental performance, and versatile use cases.

Read more
Sustainable paints collection on 2050 Materials
Data & Research Dec 12

A Guide to Paints with Sustainable Properties in 2024

This article highlights some of the most interesting green paint products available today, showcasing their unique features, sustainability credentials, and practical applications.

Read more
EcoSheetPile™ steel sheet piles displayed on the 2050 Materials platform for sustainable construction.
Data & Research Nov 28

Aluminum and Steel Products for Better Environmental Impact

This article explores innovative advancements in aluminum and steel products designed to mitigate environmental impacts.

Read more