Luca BarsottiniLuca Barsottini
Data Engineer · Analytics

Building pipelines
that matter

I work at the intersection of data and people. Most data professionals are good at building pipelines. Fewer make sure those pipelines actually change how a business thinks.

That's where I focus.

SOURCERaw EventsSOURCEHR DataSOURCEKPI MetricsINGESTPython · APIsTRANSFORMDataiku · dbtPython · SparkWAREHOUSEBigQueryDASHBOARDSPowerBIINSIGHTSManagementDocker · Git · IaC · CI/CD
Python ↑ activeSQL ↑ activeApache Spark ↑ learningdbt ↑ activeBigQuery ↑ activeDataiku ↑ 3yr+PowerBI ↑ activeGCP ↑ activeDocker ↑ activeKestra ↑ learningAirflow ↑ learningBruin ↑ learningPandas ↑ dailyGit ↑ dailyPython ↑ activeSQL ↑ activeApache Spark ↑ learningdbt ↑ activeBigQuery ↑ activeDataiku ↑ 3yr+PowerBI ↑ activeGCP ↑ activeDocker ↑ activeKestra ↑ learningAirflow ↑ learningBruin ↑ learningPandas ↑ dailyGit ↑ daily
Technical stack
What I work with
3+ years of hands-on experience across the full data lifecycle
Data & Analytics
SQL
Python · Pandas
Data Engineering
Data Analysis
Business Intelligence
Tools & Platforms
Dataiku
PowerBI
Apache Spark
dbt
Google BigQuery
Kestra
Airflow
Bruin
Cloud & Infra
Google Cloud Platform
Pub/Sub
Docker
Git
ETL Architecture
Programming
Python
Java
SQL
REST APIs
HTML · JS
Python95%
PowerBI88%
Dataiku90%
SQL85%
GCP · BigQuery78%
dbt · Spark65%
IN PROGRESS
Data Engineering Certification
Spark · dbt · BigQuery · Cloud Architecture
EDUCATION
Executive Master in Data Science
Rome Business School · 2022 – 2023
MSc Telecommunication & Software Engineering
Politecnico di Milano · 2019 – 2022
BSc Ingegneria delle Telecomunicazioni
Politecnico di Milano · 2012 – 2017
Career
Experience
Consulting across aerospace, insurance, smart metering and telecom↓ Download CV
Data Analyst & Data Engineer
Tinvention · Thales Alenia Space
Dec 2023 – Present
+
Built and maintained Dataiku pipelines consolidating multi-source monthly operational data for reliable reporting
Developed Python logic for workforce and project performance data — delivery timelines, resource allocation, incident tracking
Created validation workflows ensuring stakeholders could trust the numbers before acting on them
Designed PowerBI dashboards tracking core KPIs, used regularly by management for strategic decisions
Primary interface between technical implementation and business stakeholders — translating data into actionable narratives
3+ year client relationship through consistent delivery and honest expectation management
PythonPandasSQLDataikuPowerBIGit
Data Engineer (Junior)
Tinvention · Generali Insurance
Mar – Nov 2023
+
ETL pipeline implementation on GCP, supporting data movement across enterprise systems
Pub/Sub for event-driven data ingestion across distributed systems
Java components for the Load phase of ETL workflows into BigQuery
Artifact deployment and release processes in a structured enterprise environment
JavaGCPBigQueryPub/SubGit
Technical Onboarding
Tinvention
Nov 2022 – Feb 2023
+
Intensive program covering Java backend, SQL, REST APIs, Spring Framework
DevOps basics — Git, Docker, Apache Tomcat, CI practices
Agile methodology and client-facing communication training
JavaSQLSpringDockerGitREST
Telecommunications Software Engineer
2i Rete Gas S.p.A · Smart Metering
Nov 2021 – Nov 2022
+
Defined technical specifications for Electronic Water Meter systems and LoRaWAN Data Concentrators (868MHz network)
Planned RF networks using Python-based coverage calculation models including Ericsson signal degradation models
Applied EU directives (MID, ATEX, RED) and Italian legislation (AEEG) in the smart water metering domain
Laboratory testing, database testing, and technology scouting
PythonLoRaWANRF PlanningSQL
Telecommunications Engineer
ALTEN Italia · Vodafone NOC
Mar 2017 – Aug 2018
+
Network infrastructure supervision at Vodafone's Network Operation Center for GSM, UMTS, and LTE mobile networks
Network performance analysis, KPI reporting, and coordination with vendors (ALU, Ericsson, Huawei, Nokia)
Internship → full role: Vodafone Core & Backbone governance, fixed and mobile network monitoring and incident management
GSM/UMTS/LTEKPI ReportingBMC Remedy
Work
Projects
Open source and personal data engineering work
⬤  In progress
DE Pipeline — Public Dataset
End-to-end data engineering pipeline on a public dataset. Ingestion → transformation → modelling → dashboard. Designed to demonstrate production patterns.
dbtBigQueryPythonGCP
⬤  In progress
Web Scraper + Data Pipeline
Documented scraping pipeline with clean data models downstream. Includes responsible data collection practices and structured output schemas.
PythonPandasSQLDocker
○  Planned
Spark Processing — Large Scale
Distributed data processing project using Apache Spark. Showcasing performance optimisation and partitioning strategies on real-world-sized datasets.
Apache SparkPythonBigQuery
About
How I work

I've spent the last few years embedded in client environments — not just delivering technical work, but understanding the people behind the requests.

I spent as much time explaining data to people as building it — and found that's where the real value gets created. A pipeline no one trusts is a pipeline no one uses.

Currently building toward full-remote opportunities in data engineering and analytics engineering. If you're working on something interesting where clear thinking matters more than impressive jargon, let's talk.

Connect on LinkedIn ↗
3+
Years consulting
2
Enterprise clients
10+
Technologies
🌍
Open to remote