Senior Data Engineer
SeniorThailandFull-time
Asia/Bangkok — daily overlap with Warsaw 14:00–18:00 BKK
Architect and build QTPILOT's quantitative research data infrastructure from scratch — designing scalable systems that power research pipelines, model development, and live trading monitoring. This is not a database maintenance role.
What you'll do
- •Design and build a point-in-time research data lake that records the state of market data as it was known at each historical moment, eliminating look-ahead bias in research and backtesting.
- •Build version-controlled, reproducible feature engineering pipelines. Every model output must be traceable back to the data state and transformations that produced it.
- •Handle the realities of equity market data: outliers and anomalous prints, corporate actions (splits, dividends, mergers, spin-offs), and symbol mapping / ticker changes across vendors.
- •Tune the data retrieval layer for both research and production access patterns — columnar formats, partitioning, caching, and query performance tuning.
- •Monitor production data flow and quality end-to-end: real-time data flow from exchanges, data availability checks for model predictions, and continuous data quality monitoring in production.
What we're looking for
- •Bachelor's degree or higher in Computer Science, Engineering, Mathematics, or a related quantitative field.
- •Minimum 5 years in Data Engineering, with at least 2 years on production AWS infrastructure.
- •Track record of designing data systems from greenfield through to production.
- •Production-grade Python (not notebook-only).
- •AWS production experience: S3, Glue, Athena or EMR, Lake Formation, Lambda.
- •Workflow orchestration: Airflow, Dagster, or Prefect.
- •Time-series & columnar storage: Parquet, Arrow, ClickHouse, or equivalent.
- •Data lakehouse: Delta Lake, Iceberg, or Hudi.
- •Advanced SQL — both analytical and transactional databases.
- •Self-driven systems thinker; clear technical communicator who writes readable documentation.
Nice to have
- •Real-time streaming: Kafka, AWS Kinesis, or MSK.
- •Equity market data experience — tick data, OHLC, corporate actions, vendor reconciliation.
- •kdb+ or other specialized time-series database experience.
- •Experience inside a systematic trading team or hedge fund.
- •ML lifecycle support — feature stores, model registry, experiment tracking.
How to apply
Send your CV and a short cover letter — tell us about a project you're proud of and a result that surprised you. We read every application.
Apply for this Position