MSEI/MSCE Research Internships

Some of the offered MSEI/MSCE research internships may be offered as tasks that also can be carried out in the context of the Project Lab Integrated Systems. If this applies it is explicitly mentioned in the associated topic description.

Available Topics

Interested in an internship or a thesis?
Often, new topics are in preparation for being advertised, which are not yet listed here. Sometimes there is also the possibility to define a topic matching your specific interests. Therefore, do not hesitate to contact our scientific staff, if you are interested in contributing to our work. If you have further questions concerning a thesis at the institute please contact Dr. Thomas Wild.

Unable to fetch resource from https://tumanager.ei.tum.de/service.php?token=lifecycle_sec_tueilis&mode=open&type=FP with exception: cURL error 60: SSL certificate problem: unable to get local issuer certificate (see https://curl.haxx.se/libcurl/c/libcurl-errors.html) for https://tumanager.ei.tum.de/service.php?token=lifecycle_sec_tueilis&mode=open&type=FP

Assigned Topics

Unable to fetch resource from https://tumanager.ei.tum.de/service.php?token=lifecycle_sec_tueilis&mode=ongoing&type=FP with exception: cURL error 60: SSL certificate problem: unable to get local issuer certificate (see https://curl.haxx.se/libcurl/c/libcurl-errors.html) for https://tumanager.ei.tum.de/service.php?token=lifecycle_sec_tueilis&mode=ongoing&type=FP

Download thesis as PDF

Prioritization Algorithms for a Page Preloading Mechanism

Description

Since DRAM typically come with much higher access latencies than SRAM, many approaches to reduce DRAM latencies have already been explored, such as Caching, Access predictors, Row-buffers etc.

In the CeCaS research project, we plan to employ an additional mechanism, in detail a preloading mechanism of a certain fraction of the DRAM content to a small on-chip SRAM buffer. Thus, it is required to predict potentially next-accessed Cachelines, preload them to the SRAM and answer subsequent memory requests of this data from the SRAM instead forwarding them to the DRAM itself. Potential candidates of pages to be preloaded need to be prioritzied and in case the memory bandwidth is not sufficiently large, the least important pages have to be skipped.

This functionality should be implemented as a cycle accurate VHDL model. A baseline system will bw provided, the goal is to extend this functionality by meaningful metrics. Depending on the progress, this can be extended or refined in subsequent steps.

A close supervision, especially during the inital phase, will be guaranteed. Nevertheless, some experience with VHDL and C programming is required.

Prerequisites

Strong Experience with VHDL Coding
Basic Knowledge is C Programmng
Basic knowledge on MPSoC, cache hierarchies etc.
B.Sc. in Electrical Engineering or similar

Contact

Oliver Lenke

o.lenke@tum.de

Supervisor:

Oliver Lenke

Student

Jakob Winterer

Download thesis as PDF

High-Level Simulation of Chiplet Architectures

Description

In the BCDC project, a working group at TUM collaborates on designing a RISC-V-based chiplet demonstration chip, of which at least two will be connected via an interposer to simulate a system of interconnected chiplets. At LIS, we work on a high-performance, low-latency chiplet interconnect with additional application-specific features managed by a smart protocol controller. It closes the gap between the underlying physical layer that takes care of data transmission across the interposer, and the system bus that attaches the inter-chiplet interface to the other components of the demonstration chip.

A high-level simulation of our system should be set up during this research internship to investigate the viability and performance of different architecture configurations. The simplest topology involves two connected identical chiplets; more complex arrangements could consist of more chiplets or other architectural elements, like an FPGA. The chiplets should be abstracted to mainly generate and process data in manners like the RISC-V CPU cores and further processing units attached to the AXI bus. A bus functional model should represent the interconnect and simulate it regarding transmission width, throughput, and latency. The modeled interconnect standard, for example, UCIe, PCIe, or modified versions of MII or SPI, and the level of modeling detail are to be explored.

As a first step, approaches to simulate specifically chiplet architectures should be researched theoretically. After choosing a suitable framework, e.g., SystemC or Matlab/Simulink, the system model should be created, and different configurations should be investigated. Ultimately, the simulation should help identify the benefits and drawbacks of these configurations and support a future HDL implementation.

Prerequisites

Basic understanding of chiplet architectures
Experience with high-level simulation
Structured and independent way of working and strong problem-solving skills

Contact

michael.meidinger@tum.de

Supervisor:

Michael Meidinger

Download thesis as PDF

High-Performance Hardware Tracing of SmartNIC Packet Processing Pipelines

Description

With the advent of research on the next generation of
mobile communications 6G, we are engaged in exploring
architecture extensions for Smart Network Interface Cards
(SmartNICs). To enable adaptive, energy-efficient and
low-latency network interfaces, we are prototyping a
custom packet processing pipeline on FPGA-based NICs,
partially based on the open-nic project
(https://github.com/Xilinx/open-nic).

Modern server architectures face constant challenges in
performance and energy efficiency. SmartNICs offer a
promising solution by offloading packet preprocessing and collecting real-time traffic analytics. These capabilities allow servers to dynamically adapt to changing network conditions and processing demands. However, operating at speeds of 100 Gbps generates massive data volumes that require sophisticated monitoring and debugging capabilities.

This thesis focuses on designing and implementing advanced hardware extensions for debugging and tracing SmartNIC packet processing pipelines using Hardware Description Language (HDL). The developed system will provide critical visibility into high-speed packet processing operations and monitoring logic.

Developing trace collection mechanisms compatible with 100 Gbps line rates
Engineering efficient solutions for capturing, moving, and storing large volumes of trace data
Implementing strategies to avoid performance degradation during trace collection
Applying suitable postprocessing and generating visualizations of key information

Prerequisites

Programming skills in VHDL/Verilog, C, Python and preferably Rust
Practical experience with FPGA Design and Implementation
Good Knowledge of computer architecture, low-level software and OSI network model
Comfortable with the Linux command line and bash

Contact

Marco Liess, M.Sc.
Tel.: +49.89.289.23873
Email: marco.liess@tum.de

Supervisor:

Marco Liess

Download thesis as PDF

Localizing Automotive Diagnostic Solutions: Software Migration and PS/PL Interface Implementation on ZCU102

Description

About the Project:
Future cars rely on a wide variety of sensors—including cameras, LiDARs, and RADARs—that generate enormous amounts of data. This data flows through the intra-vehicular network (IVN) to processing nodes, ultimately triggering actuators. With strict timing constraints essential for vehicle safety, time-sensitive networking (TSN) is now a critical component in modern automotive systems. Within the context of the EMDRIVE project, our team is developing new monitoring and diagnostic approaches to detect errors early and maintain functional safety in highly automated driving environments.

Project Description:
The primary goal of this project is to migrate existing software packages—used to record ECU traces and analyze processing anomalies—onto the ZCU102 board. This migration will enable local processing of anomalies and establish a robust PS/PL interface between the anomaly detection hardware (implemented on the FPGA) and the processing system running the software.

The key tasks include:

TAS Tool Configuration: Bring up the TAS tool and configure it to work with the Multi Core Debug Solution (MCDS) for trace recording.
Trace Analyzer Deployment: Bring up and configure the Trace Analyzer to parse recorded traces and detect deviations in processing.
Software Migration: Migrate the existing software packages to run on the Processing System (PS) of the ZCU102 board.
Interface Integration: Develop and integrate a stable interface between the Programmable Logic (PL) and the PS, ensuring efficient sharing of data, status, and configuration information.

Key Responsibilities:

Analyze existing software packages and understand the hardware integration requirements.
Configure and validate both the TAS tool and the Trace Analyzer.
Adapt and optimize software for deployment on the ZCU102 board.
Develop and implement a robust PS/PL interface for seamless communication between hardware and software.
Collaborate with interdisciplinary teams to integrate and test the complete system.

Prerequisites

Required Skills:

Proficiency in C programming.
Strong understanding of System-on-Chip (SoC) architectures and microcontroller modules.
Background in automotive applications and systems.
Experience with hardware description languages (e.g., VHDL) and embedded systems (preferred).
Familiarity with Linux-based systems and FPGA integration is a plus.

Benefits:

Hands-on experience with cutting-edge automotive diagnostic technology.
Exposure to advanced hardware-software integration and embedded systems.
Opportunity to contribute to projects that enhance the safety and reliability of future vehicles.
Collaborative work environment with industry-leading partners.

Contact

Zafer Attal

zafer.attal@tum.de

Supervisor:

Zafer Attal

Download thesis as PDF

Profiling-based Prefetcher Design

Description

In memory hierarchy, multi-levels caches are used to cache datas in order to avoid the long access latency when accessing to the DRAM. However, when cache misses happen, the long memory access latency will still stall the program execution. To further improve the performance, prefetching techniques are widely used in our modern processors. A prefetcher predict and fetch the data to cache/buffer before it is actually accessed, thereby hiding memory access latency.

Our prefetcher reacts to cache load misses by prefetching large memory regions. While simple, this can severly burden the DRAM bandwidth and flood the buffer, especially when many of those prefetched regions are not actually needed.

Applications exhibit varied memory access patterns. Some memory regions show some characteristics that they are better candidates for prefetching. By profiling an program in advance, it is possible to determine which memory region should be prefetched and which memory region should be evicted earlier.

In this internship, the student will help to implement the prefetching priority and eviction policy in our existing SystemC model. And by using the profiling result in the policy, we expect to get a performance improvement compared to the original model.

Prerequisites

Basic computer architecture knowledge
Experience with Python programming
Better if have SystemC knowledge.

Supervisor:

Yuanji Ye

Student

Yuxuan Li

MSEI/MSCE Research Internships

Available Topics

Assigned Topics

FP: Prioritization Algorithms for a Page Preloading Mechanism

Prioritization Algorithms for a Page Preloading Mechanism

Description

Prerequisites

Contact

Supervisor:

Student

FP: High-Level Simulation of Chiplet Architectures

High-Level Simulation of Chiplet Architectures

Description

Prerequisites

Contact

Supervisor:

FP: High-Performance Hardware Tracing of SmartNIC Packet Processing Pipelines

High-Performance Hardware Tracing of SmartNIC Packet Processing Pipelines

Description

Prerequisites

Contact

Supervisor:

FP: Localizing Automotive Diagnostic Solutions: Software Migration and PS/PL Interface Implementation on ZCU102

Localizing Automotive Diagnostic Solutions: Software Migration and PS/PL Interface Implementation on ZCU102

Description

Prerequisites

Contact

Supervisor:

FP: Profiling-based Prefetcher Design

Profiling-based Prefetcher Design

Description

Prerequisites

Supervisor:

Student