Most energy efficient Core on a private Telco Cloud: Energy optimized redundancy model for telco applications
Kubernetes, Energy Efficiency, 5G Core Network
Description
Motivation:
Deutsche Telekom is operating and constantly developing and improving its own cloud to operate internet and telephony services. The Kubernetes Cloud and the Telco applications are combined to form a TaaP – Telco as a Platform. The TaaP are thousands of servers and hundreds of applications. The energy efficiency of the TaaP is a key success criterion in order to optimize costs, energy consumption, and carbon emissions. Hence the concept of Full Stack Energy Management is established. The focus is to optimize hardware, software and services towards energy efficiency without affecting service availability and robustness.
Problem & Challenge:
In the Telco industry, so far, HW redundancy has been the baseline for service robustness and resilience. The introduction of virtualization and containerization concepts resulted in an additional redundancy level above the hardware. Classical redundancy models don’t apply to this multi-layer redundancy any longer. Moreover, there is no mathematical model that calculates the service availability for such a case.
Specific Problem Formulation:
On a TaaP there are multiple layers of redundancy in Hardware and Software. On the one hand, there are multiple site deployments, where each site has multiple hundreds of servers. On the other hand, on each site, each server has multiple redundant hardware parts like power supply. Moreover, a Kubernetes Cluster, which is homed on one site, hosts multiple microservices, each with a different redundancy concept like active/passive, n+1, n+m, etc. This setup of mixed HW and SW redundancy causes inefficiency and is not easy to calculate or simulate in terms of overall service, network, site, redundancy, and energy consumption.
Solution Approach:
There are multiple different parameters in HW and SW that impact the service availability and energy consumption. Firstly, a comprehensive list of these parameters is required, including modeling of dependencies. Secondly, a model needs to be set up to consider all of these parameters into “one equation”.
Expected Outcome:
A simulation and mathematical model should be developed that considers software and hardware redundancy across multiple sites and SW layers in order to calculate the network-wide service availability. Moreover, the model should allow the optimization of the following parameters: least required HW based on predefined service availability, least energy consumption, and best redundancy.
Prerequisites
- Familiarity with tools such as GitLab and Wiki platforms.
- Proficiency in English. The project language is English and the team spans across four EU countries.
- Basic Kubenetes Knowhow.
- High level of self-engagement and motivation.
Contact
- Manuel Keipert (manuel.keipert@telekom.de)
- Valentin Haider (valentin.haider@tum.de)
- Razvan-Mihai Ursu (razvan.ursu@tum.de)