Felipe Altermann
Senior Data Analyst
projects

Databricks Incremental
Data Injestion

The project explores modern data engineering in Databricks, using a star schema and the Medallion Architecture for incremental ingestion via Spark Streaming and Auto Loader. It features dynamic job orchestration, declarative Lakeflow pipelines, and automated SCD/fact builders, culminating in a KPI Databricks Dashboard for a simulated stakeholder.

Documentation

 tools

 code

Regional Aviation
in Brazil

Capstone project at Ironhack Data Analytics Bootcamp — from answering the question of 'what is a regional airport' to an in depth analysis of the industry in Brazil — putting machine learning into action with international aviation metrics to classify airports in clusters.

Storytelling

 tools

 code

"meat, the future?"
Tableau Viz

A #viz4ClimateAction to make us — yeah! me and you — aware, wake up and take personal action about worldwide explosive meat consumption growth and its huge share in the total Greenhouse Gas Emissions.

Storytelling

 tools

 code

Where in Australia to build a 'shark-free' family resort?

Shark Attacks — data cleaning and manipulation with Pandas — was my first project at Ironhack's Data Analytics Bootcamp. The given dataset was extremly messy and dirty, so the main pythonic challange here was to have it clean and usable. But before starting to transform beast into beauty I was also challanged to develop a story based on a business question to answer.

Storytelling

 tools

 code

Linear Regression: Diamonds Price Prediction

Work with data to understand the characteristics of a diamond which are most likely to influence its price. Rick — our client — has 5,000 diamonds and asked us to estimate their price based on a historic dataset with over 54,000 diamond prices. Our assignment is to estimate the price of Rick’s 5,000 diamonds achieving the smallest amount of error, so he can sell them properly. We will specifically measure the root mean squared error (RMSE) of our predictions.

 tools

 code

Google Cyclistic Bike Rental Case Study

How do annual members and casual riders use Cyclistic bikes differently? — my analysis for the Google Data Analytics Professional Certificate capstone project.

Storytelling

 tools

 code

certifications

Data Engineer

Associate


by Databricks

Issued:
May 17, 2025

See credential

Data Analyst

Associate


by Databricks

Issued:
September 30, 2024

See credential

Databricks

Fundamentals


by Databricks

Issued:
May 19, 2024

See credential

Data Engineer

in Python

by DataCamp

Issued:
November 3, 2023

See credential

dbt

Fundamentals


by dbt Labs

Issued:
June 27, 2023

See credential

Data Analytics

Bootcamp


by Ironhack

Issued:
January 14, 2022

See credential

Data Analytics

Certificate


by Google

Issued:
May 17, 2021

See credential

Data Analyst

in Python


by DataCamp

Issued:
November 7, 2021

See credential

technical skills
Languages & Data Tools (ETL/ELT)
Python • R • SQL • Pandas • NumPy • Scikit-learn • Airflow • SSIS • Git • Docker • dbt


Databases & Warehousing
PostgreSQL • MySQL • SQL Server • Snowflake • Redshift • BigQuery • DynamoDB • MongoDB


Cloud & Platforms
AWS (e.g., S3, Glue, Lambda, RDS) • Azure • GCP (e.g., Dataflow, Dataproc) • Databricks


BI & Visualization
Power BI • Looker • Tableau • QuickSight • Matplotlib


Other Tools & Techniques
HTML • CSS • Microsoft Excel • Statistical Analysis • Linear Regression


published articles
Article #1
Data Visualization with Databricks Series
Quickly Visualize and Understand Your Data
Published:
June 4, 2024
Read on Medium
Article #2
Data Visualization with Databricks Series
Dashboards to Better Share Data Insights with Your Stakeholders
Published:
November 7, 2024
Read on Medium
Article #3
Data Visualization with Databricks Series
Storytelling: Go Beyond Showing Data — Tell a Story With It!
Published:
March 18, 2025
Read on Medium