Hi, I'm Stephanie Choi

Python Software & Data Engineer 👩🏻‍💻

I build scalable data pipelines and backend services. Passionate about writing clean, maintainable code, automating processes, and helping clients modernize their tech stack with efficient, future-ready solutions.

PythonSnowflakeDBTApache AirflowPostgreSQLDockerAWSDagsterPolarsESP32MQTT

Featured Projects

A collection of personal projects showcasing my work in data engineering and software development.

Data Pipeline Orchestrator (DBT+Airflow+Snowflake)
Scalable ETL pipeline for processing large datasets utilising DBT+Airflow+Snowflake, includes data quality checks and monitoring.
PythonDBTApache AirflowSQLAWS S3CosmosSnowflake
Data Pipeline Orchestrator (Dagster)
Scalable ETL pipeline for processing large datasets from Dagster, includes data quality checks and monitoring.
PythonDagsterSQLDockerAWS S3Polars
Log Analysis Tool
CLI tool for parsing and analyzing application logs. Features pattern matching, anomaly detection, and automated report generation.
PythonRegexBash Scripts
Taiwan Stock Exchange Data
Extract daily information from TWSE and pre-process before ingesting the information to database.
PythonposgresqlRabbitMQCeleryPydanticmatplotlibfastAPIDigitalOceanPandas
Trading Journal Application
Day trading journal platform where users can log, track, analyze trade performance with visual dashboard using plotly Dash and tagging trade comments feature.
PythonPlotly DashSQLDjango

Latest Blog Posts

Recent thoughts and technical articles

View all
7 highly recommended newsletters/instagram influencer worth subscribing as a software developer or data engineer in 2025.
Some insights from snowflake world tour London 2025.
Heyo! I'm glad that you stumbled upon this site. Excited to share my journey in software and data engineering.

Work Experience

Professional experience in data engineering and software development.

Filter by:
AI
Backend
Cloud
Data Engineering
Frontend
Software Engineering
Data Engineer
CRU Group InternationalLondon, United KingdomHybrid
Sep 2025 – Present3 months

Building scalable data infrastructure and ETL pipelines for global commodity market intelligence, processing massive datasets to deliver actionable insights for strategic decision-making.

Python
git
Apache Airflow
DBT
Snowflake
AI
Data Engineering
Docker
Sigma
Terraform
Rudderstack
NocoDB
  • Improving onboarding efficiency and knowledge sharing across teams
  • Building data orchestration workflows using Airflow, enabling data quality checks and reliable end-to-end data pipelines
  • Leverage Snowflake for scalable data warehousing and analytics, optimizing query performance
  • Pursuing the SnowPro Core Certification (COF-C02) to deepen expertise in Snowflake architecture, data modeling, and performance optimization
Software Engineer I
Suade LabsLondon, United KingdomHybrid
Sep 2023 – Sep 20252 years

Engineered and maintained software solutions for regulatory risk reporting in the financial sector, with a focus on ETL pipelines to ensure compliance, accuracy, and reliability for major banks.

Python
Linux
git
Clickhouse
Dagster
Docker
FastAPI
PostgreSQL
CI/CD
Bash Scripts