Hi, I'm Stephanie Choi

Python Software & Data Engineer 👩🏻‍💻

I build scalable data pipelines and backend services. Passionate about writing clean, maintainable code, automating processes, and helping clients modernize their tech stack with efficient, future-ready solutions.

PythonSnowflakeDBTApache AirflowPostgreSQLDockerAWSDagsterPolarsESP32MQTT

Featured Projects

A collection of personal projects showcasing my work in data engineering and software development.

Data Pipeline Orchestrator (DBT+Airflow+Snowflake)
Scalable ETL pipeline for processing large datasets utilising DBT+Airflow+Snowflake, includes data quality checks and monitoring.
PythonDBTApache AirflowSQLAWS S3CosmosSnowflake
Data Pipeline Orchestrator (Dagster)
Scalable ETL pipeline for processing large datasets from Dagster, includes data quality checks and monitoring.
PythonDagsterSQLDockerAWS S3Polars
Log Analysis Tool
CLI tool for parsing and analyzing application logs. Features pattern matching, anomaly detection, and automated report generation.
PythonRegexBash Scripts
Taiwan Stock Exchange Data
Extract daily information from TWSE and pre-process before ingesting the information to database.
PythonposgresqlRabbitMQCeleryPydanticmatplotlibfastAPIDigitalOceanPandas
Trading Journal Application
Day trading journal platform where users can log, track, analyze trade performance with visual dashboard using plotly Dash and tagging trade comments feature.
PythonPlotly DashSQLDjango

Latest Blog Posts

Recent thoughts and technical articles

View all
Keeping package version updated across projects can be tedious. This post shows how a GitHub Action can automate the process, saving time and reducing manual effort.
Automate tedious variable or function renaming across multiple repositories with a simple script. Save time, reduce errors, and streamline your workflow when updating code in several projects at once.
Key takeaways and study strategies from my journey to passing the Snowflake Core Certificate, including recommended resources, hands-on practice tips, and lessons on mastering core concepts.

Work Experience

Professional experience in data engineering and software development.

Filter by:
AI
Backend
Cloud
Data Engineering
Frontend
Software Engineering
Data Engineer
CRU Group InternationalLondon, United KingdomHybrid
Sep 2025 – Present4 months

Building scalable data infrastructure and ETL pipelines for global commodity market intelligence, processing massive datasets to deliver actionable insights for strategic decision-making.

Python
git
Apache Airflow
DBT
Snowflake
AI
Data Engineering
Docker
Sigma
Terraform
Rudderstack
NocoDB
Software Engineer I
Suade LabsLondon, United KingdomHybrid
Sep 2023 – Sep 20252 years

Engineered and maintained software solutions for regulatory risk reporting in the financial sector, with a focus on ETL pipelines to ensure compliance, accuracy, and reliability for major banks.

Python
Linux
git
Clickhouse
Dagster
Docker
FastAPI
PostgreSQL
CI/CD
Bash Scripts