hi, my name is

Peter Caleb Ayertey.

I solve expensive data problems at scale.

I've migrated 100+ legacy ETL pipelines to AWS and Azure, cutting runtime by 50% and manual work by 90%. I've built fraud detection infrastructure processing $2B+ in daily transactions, deployed Delta Lakehouses for financial services, and architected real-time event-driven systems across three continents. I turn chaotic data into reliable infrastructure—fast, automated, and boring.

01. Experience

Data Engineer @ Bosonit

Nov 2025 - Present | Logroño, Spain (Remote)

  • Built API-driven ingestion services to extract and normalize data from external systems into analytics-ready models
  • Architected AWS S3-based data lakes for scalable storage and ingestion workflows to support downstream analytics
  • Developed Docker-based test environments for API connectors, reducing deployment issues by 40%
  • Implemented database version control and environment consistency using Alembic for schema migrations
  • Built Grafana dashboards to monitor connector health, pipeline performance, and data quality metrics

02. Impact

$2B+
Daily transactions processed via Hawk-ai fraud detection workflows at Ecobank Group
90%
Reduction in manual effort through AWS event-driven pipeline architecture
50%
Runtime improvement migrating 100+ SSIS pipelines to Azure Databricks

03. Tech Stack

Languages

Python
SQL
Bash

AWS

Redshift
Glue
Lambda
S3
MWAA

Azure

Databricks
ADF
Synapse
Data Lake

Tools

Apache Airflow
Spark
Kafka
Docker
Terraform

04. Featured Projects

Real-Time Ride-Hailing Pipeline

view source →

Architected streaming pipeline using AWS ECS and Lambda into Redshift, orchestrated via MWAA Airflow with custom S3 sensors. Built QuickSight dashboards for real-time revenue and density analytics.

AWS Lambda Redshift Airflow QuickSight

Automated POS Data Validation

view source →

Developed serverless batch validation system via AWS Glue and Lambda. Designed DynamoDB Dead Letter Queue logic for rejected rows, reducing manual cleanup and enforcing schema integrity at scale.

AWS Glue Lambda DynamoDB Python

IoT Heartbeat Monitor System

view source · prod_lab5 →

Built distributed monitoring system via Apache Kafka. Containerized full stack (Zookeeper, PostgreSQL) with Docker Compose. Production logic maintained on prod_lab5 branch demonstrating deployment strategies.

Apache Kafka Docker PostgreSQL Zookeeper

05. Certifications

06. What's Next?

Get In Touch

I'm currently open to new opportunities where I can architect data infrastructure at scale. Whether you're building the next fintech platform or modernizing legacy systems, let's talk.

Say Hello