Building the Cloudification KPI Application – Part 1

31 January 2022 14:03 5 minutes to read

Illustration of documents in a cloud on blue background

SEB has chosen a Cloud First strategy based on the Google Cloud Platform. It is a pretty big change for a bank that has been self-sufficient in IT for around 50 years! But how do we track our transition to the Cloud? One way is to measure and work with KPIs or “Key Performance Indicators.

This blog post is based on a presentation by the authors at the Build Stuff Conference November 19, 2021.

Peter Ferdinand Drucker, author of 39 books on the topic of Business Management is credited the quote “If you can't measure it, you can't improve it”. Businesses use KPIs to monitor whatever drives growth. A good KPI is specific, measurable, and makes sense to overall business goals.

We identified and adopted a few principles:

Agree on a limited set of KPIs. If you have too many, you will risk losing focus and there will be administration that no one cares for!
Do not build a burden. Automate collection of data and everything else as much as possible

So, if we want to measure our transition to the Cloud, what makes sense for us to measure? Here are some examples of indicators we came up with:

Ratio between applications deployed to Cloud versus traditional on-premise solutions. Cloud deployments increase and on-premise deployments decrease.
Usage of managed versus less managed services. Cloud is not about using someone else's computer (it never was your computer anyway). It is about using high-end services at a reduced cost!
Cost and number of projects and users over time.
Carbon footprint. Developers and technicians can reduce the carbon footprint by making informed design decisions!

Solution overview

This is the design we came up with for the Cloudification KPI application.

The application consists of four main modules:

Data sources.
Data gathering functions.
Database.
Data visualization.

The first one consists of the different information sources we employ in the application. Some of them are retrieved directly from the cloud, while others are obtained from on-premise services.

For the cloud-based data, cloud functions are responsible for retrieving and filtering the necessary information from our data sources and storing it in our database in the proper format. Cloud Scheduler is used to schedule these functions to run daily. A different approach is required for on-premises data, as it is not available internally within the cloud. A Kubernetes Cronjob located on-premises is used to retrieve this information and send it to the Google Cloud Platform environment on a daily schedule.

Once all data is available in Google Cloud it is stored into Firestore, a NoSQL database which stores documents in json format. This provides faster and more cost-efficient access than directly querying each of the databases.

Once all the data has been aggregated in Firestore, our web application can retrieve all the necessary data from one location. Data manipulation is done server-side using the Pandas Python library and the frontend visualization is handled by Dash, a backend framework for the Plotly visualization library. We will go into more detail on these later in the article.

It is necessary to highlight that the entire application setup and configuration is done by code; we use Terraform for infrastructure provisioning through a GitLab CI/CD pipeline.

Below, an overview of the application:

Google API Discovery Service

So where to find data for our KPIs then? The Google Cloud Platform itself contains a wealth of data which you can use for KPIs, but how to find it? There are several sources, and an extremely useful tool for this is the “Google API Discovery Service.”

Google API Discovery Service

Google provides hundreds of APIs together with client libraries. If they would build every API the traditional way, it would multiply to thousands of client libraries to maintain. Instead, they built Client Libraries on top of the Google API Discovery Service. The rationale is that they now only have ten libraries to maintain: one for each supported programming language. The trick is that the Client Libraries use a machine readable “Discovery Document” which contains all information about the APIs.

Client Library

The benefit for us is that we get tools where we can easily search for APIs. These tools also let you try out the APIs directly in the interface and you get example code for many of them!
 
Google API Explorer

Useful data sources

Examples of useful data sources found in the Google API Discovery service are:

Google Cloud Resource Manager API - Number of projects, folders, etc.
Google Admin Directory API – Number of users.
Google Asset Inventory Service - A metadata inventory service which allows you to view, monitor, and analyse all your Google Cloud assets.
Google Cloud Billing Service – Provides detailed billing information.
Google Carbon Footprint Service – A new and interesting service which allows measuring and reporting of your cloud carbon emissions. This enables you to reduce your carbon footprint by making informed decisions as a developer or technician! Cool stuff, see the link below for more info!

Google Carbon Footprint Service

This concludes part one of our blog post “Building the Cloudification KPI application”. In our next post we'll discuss how we manage and display the data using Pandas and Dash!

Cloud Enabler

Joachim Bergström

Cont Cloud Tech CCT

Fernando García

Cont Cloud Tech CCT

Jonas Högman

Project GDELT- Sentiment Analysis of World News in the Cloud

Project GDELT is a cloud-based application that lets SEB employees get interactive sentiment and frequency analysis of over 7.5 billion news articles in almost real time. This blog post gives an insight into the vision of the project and inner workings of building a small application in the cloud.

3 guardrails to cloud

Cloud. Once it was only a fancy buzzword. Now it is already on the spotlight for most companies. Replacing “traditional” infrastructure. Even more – becoming the new traditional infrastructure.

Why don't you go and play in a sandbox?

As kids many of us used to love to be in a sandbox or on the beach. Give it a spade, a bucket, some molds, imagination and time and you could build anything and even tear it down and start all over again. So, is this only childhood memories or can this "method" be useful even in our normal work?

Solution overview

Google API Discovery Service

Useful data sources

Joachim Bergström

Joachim Bergström

Fernando García

Fernando García

Jonas Högman

Jonas Högman

Related content

Project GDELT- Sentiment Analysis of World News in the Cloud

3 guardrails to cloud

Why don't you go and play in a sandbox?