Επεξεργασία

Κοινή χρήση μέσω


Measure Azure app sustainability by using the SCI score

Azure Monitor
Azure Automation
Azure Logic Apps
Azure Table Storage
Power BI

The solution described in this article can help you create a sustainability model for applications that are hosted on Azure. The model uses proxies that, over time, allow you to score an application's carbon impact and efficiency. The score is known as the Software Carbon Intensity (SCI) score. It provides a baseline for measuring changes in an application's carbon output.

Architecture

Diagram of a sustainability model that scores the carbon impact of an application.

Download a Visio file of this architecture.

Dataflow

  1. Configure application data sources that you'll use to calculate your SCI score. The data can be the emissions measurements provided by the Carbon optimization blade in the Azure portal, or they can be proxy measurements from non-Microsoft sources or systems.
  2. Export carbon emission data to your data lake.
  3. Use event handlers like Azure Functions or Azure Logic Apps to calculate the SCI score. The output is the amount of carbon emitted in grams per unit, where unit refers to the application scaling factor, or an approximation of it that's based on proxies.
  4. Use technologies like Azure Functions, Logic Apps, or Azure Automation runbooks to trigger demand shaping on the application or to initiate the application's predefined eco mode.
  5. Use Power BI to report and visualize the score and its variation over time.

Components

  • The Carbon optimization blade in the Azure portal provides carbon emission measurements of Azure workloads at resource-group level.
  • The Cloud for Sustainability API provides the underlying data for carbon optimization. You can use it to retrieve information on your subscription's emissions.
  • Application Insights is a feature of Azure Monitor that provides application performance management (APM). It can help you gain powerful insights into how people use your app. You can use this knowledge to make data-driven decisions about improving your application's efficiency.
  • Azure Blob Storage stores the emission information from Azure carbon optimization, from custom calculations, or from other proxies for emissions.
  • Azure Data Lake is a centralized repository that ingests and stores large volumes of data in its original form. The data can then be processed and used as a basis for various analytics needs.
  • Azure Logic Apps enables you to create and run automated workflows with little to no code. By using the visual designer and selecting from prebuilt operations, you can quickly create a workflow that integrates and manages your proxy sources, data storage, and efficiency calculation systems.
  • Azure Functions enables you to run small units of code. It automatically scales resources based on demand, and you pay only for the actual execution time. You can use it to make sustainability calculations and store them in Blob Storage or a data lake.
  • Azure Automation provides process automation via runbooks. You can use the runbooks to implement complex logic, by using PowerShell code, that can influence your application to improve efficiency. This service can also add business value by reducing errors and operational costs.
  • Power BI allows you to turn your data into analytics and reports that provide real-time insights.

Alternatives

The Azure services described in this article can be replaced with similar services. To increase density and utilization of existing resources, perform the calculations with the minimum impact to your infrastructure by using Azure services or tools that are already deployed in your environment:

  • You can substitute Power BI dashboards with Azure Monitor workbooks or Azure Managed Grafana services.
  • You can substitute Application Insights with any other application performance management (APM) tool, like Elasticsearch application performance management (APM) or OpenAPM.
  • Data in the form of tables or unstructured data can be retained in any system of records, like MySQL or MariaDB, or Azure Cosmos DB and MongoDB.
  • If you have a running Azure Functions or Logic Apps space, you can run the calculation on a regular basis from your existing deployments.
  • If the application resources are distributed across multiple resource groups, you can use tags to correlate cost data and calculate the amount of carbon emitted by the application.

Scenario details

This architecture is designed to gather carbon optimization data from Azure and other sources to provide a comprehensive view of an application's environmental impact. Data is collected from Azure carbon optimization. For non-Azure environments, a proxy is used to retrieve relevant carbon metrics. After the data is consolidated, SCI calculations are performed to assess the overall carbon footprint. The results are then stored in an Azure Storage account or data lake for long-term retention, which enables BI analysis and historical reporting. This approach ensures centralized tracking of carbon impact across diverse infrastructure and supports strategic sustainability efforts.

The carbon emissions information is partially gathered from the Azure portal Carbon optimization blade and partially calculated, when possible, via proxy.

Screenshot of the Carbon optimization blade.

It's essential to use a separate architecture to gather Azure carbon optimization data for two key reasons:

  • Azure carbon optimization data is stored and displayed only for the past twelve months (in a rolling window). When long-term tracking of a carbon footprint is required, a dedicated system ensures the retention of detailed historical information.
  • An application might span multiple infrastructures, with Azure as only one component. A separate architecture enables centralized monitoring of carbon impact across all environments to provide a holistic view and ensure more comprehensive sustainability insights.

Note

Greenhouse gases aren't made up of only carbon dioxide, and they don't all have the same impact on the environment. For example, one ton of methane has the same heating effect as 80 tons of carbon dioxide. In this article, everything is normalized to the CO2-equivalent measure. All references to carbon refer to the CO2-equivalent.

Data sources

In general, you should create a proxy equation with few variables. The proxy metrics that you choose should represent the application's behavior and performance.

These metrics are used in this example:

  • The carbon emission of the infrastructure, which is retrieved from the carbon emissions API. This API is the source for both the Emissions Impact Dashboard and the Carbon optimization blade in the Azure portal. The data is available at resource group level, which makes it easier to track your application's emissions.
  • Performance and scale metrics of the application, collected from Application Insights:
    • The scaling factor (API calls, server requests, or some other metric) for the application
    • CPU usage
    • Memory usage
    • Response time (send and receive)

For a tutorial on how to set up Application Insights to get the required metrics, see Application Insights for ASP.NET Core applications.

You can add other variables to the equation, such as:

  • Edge services and infrastructure carbon emissions.
  • The time when users connect, as electricity production and demand vary with time.
  • Any other metric of the app that can tell you how its performance changes over time.

By building this equation into a score that can also reflect the number of users, you create the closest approximation to a carbon score. This score is your benchmark for any further change and improvement toward the sustainability of the app.

Cost is another consideration that's associated with application performance. In most cases, a direct correlation between performance efficiency and cost and carbon savings can be established. This correlation leads to the following assumptions:

  • When performance is higher but costs are the same, you have optimized the app and reduced carbon emissions.
  • When costs are reduced but performance is the same, you have optimized the app and reduced carbon emissions.
  • When both performance and costs increase, you haven't optimized the app, and you have increased carbon emissions.
  • When costs increase but performance is reduced or the same, you haven't optimized the app and have increased carbon emissions (or the energy cost is higher, which is also a cause for higher carbon emissions).

This correlation between the SCI score, cost, and performance of an application is unique for every application and depends on many factors. By gathering data for these three variables, you can create an algorithm of correlation that allows you to forecast any variation of the three, and to make informed decisions on the application architecture and patterns.

Calculations

In the scenario described here, it's not possible to form a discrete calculation for the proxies that are used. Instead, the data gathered from the Emissions Impact Dashboard is processed as a starting point. Here's the SCI baseline calculation:

SCI = C*R

In this equation:

  • C is the carbon emissions for the application. This value is affected by how the application is deployed on Azure. For example, if all the application resources are in a single resource group, C is the carbon emissions for that resource group.

    Note

    For now, other sources of emissions for the application are ignored because they depend on the architecture and edge/user behavior. If you use proxies for data, you can consider these sources in the next step.

  • R is the scaling factor for the application. This can be the number of average concurrent users for the time window, API requests, web requests, or some other metric. This value is important because it leads to a score that accounts for the overall impact of the usage of the application, not just its deployment footprint.

The time window is, of course, another important aspect of this calculation: carbon emissions for any energy-consuming device or system vary over time, because the energy grid might have renewable or alternative energy sources (for example, solar power) at some times but not at others. It's therefore important to start with the shortest possible timeframe to increase precision. For example, you might start with a daily or hourly calculation.

The carbon emissions API currently provides monthly carbon information based on the services within a subscription, at the resource group level. By using the provided REST API, you can export emissions data to a data lake that holds all sustainability data for the application.

Data storage

You should store the carbon and carbon proxy information that you gather in a solution that you can connect to dashboards or reports. Doing so enables you to visualize your carbon score over time and make informed choices. To improve sustainability and align with Azure Well-Architected Framework best practices, we recommend that you use the minimum viable system. (For more information, see Data and storage design considerations for sustainable workloads on Azure and Application platform considerations for sustainable workloads on Azure.) Azure Data Lake Storage is used in this architecture.

Data correlations

When you start gathering data on the carbon, performance, and cost of your application, you'll have valuable information that allows you to create a correlation algorithm that's specific to your application and that will provide guidance when you plan for cost, performance, and carbon optimization.

For more information, see How to select algorithms for Azure Machine Learning.

Data display

You can display your data and calculations in various ways, including customized Azure Monitor dashboards and simple Power BI dashboards.

What can your SCI score trigger?

After you know your sustainability score, you might wonder how you can improve it.

If you can score your application's carbon impact by using proxies, the next step is to focus on defining actions that can be triggered by unfavorable conditions in the score. Some examples of these conditions are:

  • Energy production and demand is at an all-time high and is therefore expensive to produce.
  • Electricity isn't available. This condition could be caused by a natural disaster or geopolitical conflict.
  • Sudden unavailability of edge infrastructure caused by resource over-consumption or supply chain issues.

When you can identify the failure points that can affect your application, you can decide what actions to take to make your application resilient to carbon spikes.

You can take one of the following actions:

  • Apply a graceful degradation of the app's services and features, as described in the Well-Arcchitected Framework documentation.

  • Create an eco-mode version of your application. Eco mode is a simpler, smaller, cheaper, more sustainable version of the application that offers minimal features. You can revert to this version during carbon emission spikes. You can also simply train your end users to use an eco version by choice. You can provide a "green button" that enables people to use a leaner interface, fewer graphics, and limited features in exchange for reduced carbon emissions.

  • If you choose to involve your users, you create an opportunity to drive a cultural change along with the technical one: - You can specify the impact of the choice: "By using the eco version, you save x amount of carbon" or "bringing our carbon score to y amount." - You can gain an understanding of user behavior and modify the eco version to reflect their choices. (Maybe they use 10% of the features and are an ideal user of the eco version.) - As the full version is optimized for emission, you can ideally eventually merge the two versions.

Considerations

These considerations implement the pillars of the Azure Well-Architected Framework, which is a set of guiding tenets that can be used to improve the quality of a workload. For more information, see Microsoft Azure Well-Architected Framework.

Security

Security provides assurances against deliberate attacks and the abuse of your valuable data and systems. For more information, see Overview of the Security pillar.

For additional security, you can use Azure Virtual Network service endpoints to remove public internet access to Azure service resources, allowing traffic only from your virtual network.

With this approach, you create a virtual network in Azure and then create private service endpoints for Azure services. Those services are then restricted to traffic from that virtual network. You can also reach them from your on-premises network via a gateway.

Keep in mind that, in order to move data from on-premises into Azure Storage, you need to allow public IP addresses from on-premises or Azure ExpressRoute. For details, see Deploy dedicated Azure services into virtual networks.

For general guidance on designing secure solutions, see Azure security documentation.

Cost Optimization

Cost Optimization is about reducing unnecessary expenses and improving operational efficiencies. For more information, see Overview of the Cost Optimization pillar.

You can deploy this architecture by using several alternative Azure services. It was intentionally kept at a minimum to save on cost and carbon emissions.

While we encourage you to use equivalent services that you already have in your application deployment, pricing information is available for each architecture component:

Performance Efficiency

Performance Efficiency is the ability of your workload to scale to meet the demands placed on it by users in an efficient manner. For more information, see Performance Efficiency pillar overview.

The primary purpose of this architecture is to provide a sustainability score for your applications via a process that has a minimal impact on cost and carbon. Most of the components are platform as a service (PaaS) and serverless Azure services that can scale independently based on use and traffic.

In this scenario, the dashboard and storage interface aren't intended for massive usage and consultation. If you plan to provide it to a large number of users, you might want to consider one of these options:

  • Decouple the extracted data by transforming it and storing it in a different system.
  • Replace Data Lake Storage or Azure Table Storage with a more scalable data structure alternative, like Azure Cosmos DB.

Contributors

This article is maintained by Microsoft. It was originally written by the following contributors.

Principal authors:

  • Paola Annis | Principal Customer Experience Engineering Manager
  • Davide Bedin | Senior Cloud Solution Architect, Application Innovation

Other contributor:

To see non-public LinkedIn profiles, sign in to LinkedIn.

Next steps

This article is aligned with the principles and methodology of the Green Software Foundation. The next step in creating a greener application is to embed the Carbon Aware SDK into your application so that triggers can be automated in real time when specific carbon conditions are met.

For recommendations to optimize Azure workloads, see Sustainability cloud workload guidance.