Hi, I'm Stephanie Choi

Python Software & Data Engineer 👩🏻‍💻

I build scalable data pipelines and backend services. Passionate about writing clean, maintainable code, automating processes, and helping clients modernize their tech stack with efficient, future-ready solutions.

PythonSnowflakeDBTApache AirflowPostgreSQLDockerAWSDagsterPolarsESP32MQTT

Featured Projects

A collection of personal projects showcasing my work in data engineering and software development.

Data Pipeline Orchestrator (DBT+Airflow+Snowflake)
Scalable ETL pipeline for processing large datasets utilising DBT+Airflow+Snowflake, includes data quality checks and monitoring.
PythonDBTApache AirflowSQLAWS S3CosmosSnowflake
Data Pipeline Orchestrator (Dagster)
Scalable ETL pipeline for processing large datasets from Dagster, includes data quality checks and monitoring.
PythonDagsterSQLDockerAWS S3Polars
Log Analysis Tool
CLI tool for parsing and analyzing application logs. Features pattern matching, anomaly detection, and automated report generation.
PythonRegexBash Scripts
Taiwan Stock Exchange Data
Extract daily information from TWSE and pre-process before ingesting the information to database.
PythonposgresqlRabbitMQCeleryPydanticmatplotlibfastAPIDigitalOceanPandas
Trading Journal Application
Day trading journal platform where users can log, track, analyze trade performance with visual dashboard using plotly Dash and tagging trade comments feature.
PythonPlotly DashSQLDjango

Latest Blog Posts

Recent thoughts and technical articles

The views I express here are mine alone and do not necessarily reflect the opinions or official position of my employer.

View all
A reflection on Rob Rawson's presentation, How to Think with AI
Recapping the highlights from Datadog Summit London 2026. Explore cutting-edge monitoring innovations, observability best practices, and insights from industry leaders that are reshaping how teams manage cloud infrastructure and application performance.
Learn how to leverage the hash_agg function in Snowflake to efficiently compare tables and identify differences.

Work Experience

Professional experience in data engineering and software development.

Filter by:
AI
Backend
Cloud
Data Engineering
Frontend
Software Engineering
Data Engineer II
CRU Group InternationalLondon, United KingdomHybrid
Sep 2025 – Present7 months

Building scalable data infrastructure and ETL pipelines for global commodity market intelligence, processing massive datasets to deliver actionable insights for strategic decision-making.

Python
git
Apache Airflow
DBT
Snowflake
AI
Data Engineering
Docker
Sigma
Terraform
Rudderstack
NocoDB
Software Engineer I
Suade LabsLondon, United KingdomHybrid
Sep 2023 – Sep 20252 years

Engineered and maintained software solutions for regulatory risk reporting in the financial sector, with a focus on ETL pipelines to ensure compliance, accuracy, and reliability for major banks.

Python
Linux
git
Clickhouse
Dagster
Docker
FastAPI
PostgreSQL
CI/CD
Bash Scripts