aPriori Technologies
Data Engineer November 2024 – March 2026, Belfast, Northern Ireland
Key Projects:
-
Source AI: ML-Powered Negotiation Assistant
- Replaced Lambda-based ingestion jobs with Airbyte (CDC), reducing data ingestion latency and eliminating key bottlenecks feeding ML workloads.
- Contributed to the architecture of Source AI, an ML-powered negotiation assistant leveraging LLMs for automated sourcing workflows.
-
GCP to AWS Cloud Migration
- Migrated the data platform from GCP to AWS, contributing to technology decisions and implementation to ensure feature parity while optimising for cost and performance.
- Designed and implemented ABAC IAM policies using resource tagging to enforce least-privilege access across cloud environments.
- Built custom dbt macros for JSON flattening and Redshift SUPER column handling, improving transformation reliability.
-
aPriori Data Mesh Platform
- Architected and developed a federated Data Mesh platform enabling domain-driven data ownership with automated metadata management and consistent governance across business units.
- Created
data-platform-cli, an internal Python CLI tool (Typer-based) that standardises Data Product scaffolding, deployment, and integration with dbt and Airflow. - Integrated Great Expectations into dbt pipelines for continuous data validation and contract testing, improving data quality SLAs.
Tools Used: Python, SQL, dbt, Airflow (MWAA), AWS (Redshift, S3, Lambda, ECS), GCP (BigQuery, Cloud Composer), Airbyte, Great Expectations, Docker
Civica
Data Scientist February 2022 – March 2024, Belfast, Northern Ireland
Key Projects:
-
Tourism Ireland: Dynamics 365 Reporting Solution
- Extracted and transformed customer engagement data from Dynamics 365 using Azure Synapse Analytics.
- Designed and developed interactive Power BI dashboards providing self-service analytics for marketing campaign performance.
- Collaborated with stakeholders to define KPIs and create strategic BI reports that informed €2M+ marketing budget allocation.
-
Northern Ireland Appeals Service: On-Premises Reporting Solution
- Refactored and optimised legacy SSAS tabular models, implementing data partitioning and query optimisation that reduced report load times.
- Developed advanced DAX measures and interactive Power BI reports enabling self-service analytics for case management.
- Managed SSAS and SSRS infrastructure, including automated refresh schedules and maintenance.
Tools Used: Azure Synapse Analytics, Power BI, T-SQL, DAX, SSAS, SSRS, Power Query, SQL Server, Dynamics 365
Sentireal
Data Scientist November 2020 – February 2022, Belfast, Northern Ireland
Key Projects:
- InterTradeIreland Co-Innovate: VR Simulation Training Platform
- Designed and implemented a serverless data extraction pipeline using AWS Lambda for cost-efficient API integration.
- Conducted exploratory data analysis and feature engineering using AWS SageMaker notebooks to identify key performance indicators.
- Developed and deployed an end-to-end ML pipeline with AWS SageMaker and CodePipeline for automated model training, evaluation, and deployment to API endpoints.
- Trained and optimised predictive models using Scikit-Learn and TensorFlow to assess VR training effectiveness.
Tools Used: Python, Scikit-Learn, TensorFlow, Docker, AWS (Lambda, SageMaker, Cognito, API Gateway, CodePipeline, CodeDeploy)
Skills & Competencies
- ML & Model Deployment: AWS SageMaker (notebooks, training, endpoint deployment), Scikit-Learn, TensorFlow, predictive modelling, feature engineering, ML pipeline CI/CD (CodePipeline/CodeDeploy)
- Languages & Frameworks: Python (Typer, Pydantic, FastAPI), SQL (PostgreSQL, Redshift, BigQuery, T-SQL), JavaScript/TypeScript, dbt Core, DAX, Power Query M, Bash
- Orchestration & Pipelines: Apache Airflow (MWAA, Cloud Composer), Airbyte CDC, AWS CodePipeline, DAG-based workflow design, retry logic, idempotent execution patterns
- Cloud & Infrastructure: AWS (S3, Redshift, Lambda, ECS, MWAA, SageMaker, Cognito, API Gateway, CodePipeline), GCP (BigQuery, Cloud Composer, Cloud Build), Azure (Synapse, Data Factory, SSAS, SSRS), Docker, Git, Jenkins
- Data & Governance: Data Mesh architecture, medallion architecture, Great Expectations, dbt, data contracts, ABAC IAM policies, federated governance, metadata management, data modelling (CDC, ETL/ELT)
- BI & Analytics: Power BI Desktop/Service, DAX, SSAS, SSRS, data visualisation, self-service reporting