THESIS: From Raw Data to Insights: Comparing Modern Data Engineering Tools
Join us for your thesis work! Gain hands-on experience, work on real projects, and develop your skills in a supportive and innovative environment!
High level description
Data engineering is the process of refining raw data into a usable state. For example, transforming raw CSV or JSON files into structured formats ready for analysis. There are many platforms and tools available to support this process, each with different trade-offs in terms of performance, scalability, and usability. This thesis will explore and compare modern data engineering platforms by applying them to real-world open datasets (such as weather data).
Who are we looking for?
Bachelor/Master of Science in Computer Science, Computer Engineering.
Project description
This thesis aims to evaluate and compare different cloud-based data engineering platforms. The work will involve building end-to-end data pipelines — from ingestion of raw open datasets (e.g., weather data) to transformation, storage, and analysis — and systematically comparing the platforms across a set of criteria. Examples of relevant platforms include Databricks, AWS-native analytics tools, or similar technologies used in industry.
Purpose and Scope
In this thesis investigate these questions:
- Data ingestion & storage – Investigate how different platforms handle ingestion and storage of raw datasets (APIs, CSV/JSON files, large historical archives).
- Data processing & transformation – Implement cleaning, aggregation, and enrichment workflows, and compare efficiency and flexibility.
- Query & analytics performance – Measure and analyse query execution times and scalability for simple and complex analytical queries.
- Cost & resource utilization – Estimate and compare the cost implications of running equivalent workloads.
- Developer experience & integration – Evaluate usability, debugging, and integration with complementary tools (e.g., BI dashboards, machine learning frameworks).
An Exciting Journey with Knightec Group
Semcon and Knightec have joined forces as Knightec Group. Together, we are Northern Europe’s leading strategic partner in product and digital service development. With a unique combination of cross-functional expertise and a holistic business understanding, we help our clients realize their strategies – from idea to complete solution.
Practical Information
This is a thesis position, located at our office in Sundsvall. Start date January or March 2026.
Please submit your application as soon as possible, but no later than 2025-11-30. If you have any questions, you are welcome to contact Johanna Edström. Note that due to GDPR, we only accept applications through our careers page.
- Business unit
- Thesis
- Role
- Bachelor thesis
- Locations
- Sundsvall

Already working at Knightec Group?
Let’s recruit together and find your next colleague.