Professional Summary
Results-driven Platform Development Engineer with over 15 years of IT infrastructure experience, including a 10+ year specialized track record on SRE and DevOps. I’m dedicated to building robust, resilient, and efficient platforms that power enterprise systems.
An expert in managing complex hybrid-cloud environments, I lead end-to-end migrations and implement robust automation to enhance system reliability and deliver significant cost savings.
I excel at bringing teams together, working across departments to deliver on shared goals. I enhance system architecture and reliability by championing meaningful SLAs and SLOs, implemented through robust observability frameworks. This approach, combined with leveraging modern cloud-native architectures and containerization, directly reduces downtime and enhances developer velocity. I’m currently expanding my expertise into the field of AI engineering.
Core Competencies
- Cloud Platforms: Microsoft Azure, AWS
- Automation & IaC: Ansible, Terraform, Git, Argo CD
- Containerization: OpenShift, Kubernetes, Docker
- Observability: Dynatrace, ELK, Prometheus, Grafana, Splunk, PagerDuty
- Operating Systems: Linux (RHEL, Debian), UNIX, Windows Server
- Scripting: Python, Bash, PowerShell
- Networking: TCP/IP, DNS, TLS, Firewalls, Routing
- Databases & Virtualization: MySQL, Oracle DB, VMware, Hyper-V
Professional Experience
Best Buy Canada | Vancouver, Canada
Platform Development Engineer – Reliability (SRE) (April 2019 – Present)
Platform Development Engineer – Performance (October 2018 – April 2019)
- Led the build-out of the Dynatrace observability platform and contributed to the operation of ELK, Prometheus, and Grafana to enhance system reliability.
- Developed Python scripts to extract maturity metrics from the Dynatrace API, forwarding them to a Prometheus Pushgateway built in OpenShift via Argo CD.
- Spearheaded the migration of the on-premise Dynatrace monitoring solution to Azure, ensuring a seamless transition and improved platform scalability.
- Deployed containerized applications into OpenShift using Argo CD for continuous delivery.
- Served as the Cloud Champion for the Reliability team, guiding cloud adoption initiatives and establishing best practices.
Key Project: Dynatrace SaaS Migration
- Led the migration of the critical system, Dynatrace, to SaaS, driving the initiative end-to-end from concept to successful migration.
- Initiated and managed vendor partnerships, leading technical and strategic discussions to align business, security, and operational requirements.
- Designed and implemented the self-hosted architecture and secured approvals from security and other stakeholders including network teams.
- Planned and executed sprint delivery cycles, including backlog creation, story definition, prioritization, and progress tracking to ensure timely completion of all milestones.
- Delivered significant infrastructure cost savings and improved system resiliency through SaaS adoption and architectural modernization.
Key Project: SolarWinds Database Upgrade (SQL Server 2016 → 2022)
- Led the full lifecycle for the migration of the SolarWinds monitoring platform database from SQL Server 2016 to SQL Server 2022.
- Performed detailed compatibility assessments, backup and restore procedures, and disaster recovery validation.
- Executed platform upgrades and regression testing to confirm stability, performance, and feature compatibility in the new environment.
- Collaborated with infrastructure and application teams to coordinate change management, downtime scheduling, and post-migration verification.
- Achieved a cost-effective, modernized database platform with improved performance, security, and maintainability.
Vision Critical | Vancouver, Canada
Technical Operations Systems Administrator (November 2017 – October 2018)
- Provided comprehensive technical support for end-user systems, resolving hardware, software, and network issues to ensure business continuity.
- Administered a hybrid IT environment spanning Active Directory and Exchange Online, leveraging PowerShell to automate user administration and operational tasks.
- Managed and operated audio-visual equipment, including streaming setups and microphone configurations, for company-wide all-hands meetings and events.
Key Project: G-Suite Deployment
- Orchestrated the end-to-end migration of over 300 users to G-Suite, serving as the technical lead.
- Engineered and implemented Active Directory integration to automate user provisioning and ensure seamless single sign-on (SSO) capabilities.
- Developed and delivered user training and documentation to facilitate a smooth transition and drive company-wide adoption of the new platform.
NHS Digital | Leeds, United Kingdom
Technical Consultant (August 2015 – October 2017)
- Engineered and implemented a new monitoring solution using LibreNMS, supplemented with custom shell scripts to provide comprehensive application oversight.
- Utilized Ansible to automate infrastructure security hardening, including the configuration of UFW/iptables.
Technical Operations Team Lead (February 2015 – August 2015)
- Led and mentored a technical operations team, delegating tasks and providing training to ensure the continuous operation of a national service supporting over one million users.
Technical Operations Technician (July 2014 – February 2015)
- Supported the deployment and technical operations for critical national systems, including NHS Spine2 and the NHS E-referral service.
- Leveraged Splunk for in-depth operational monitoring, including message tracing and application health analysis, to accelerate incident investigation.
Host Europe Group | Leeds, United Kingdom
Managed Services Hosting Technician (January 2014 – July 2014)
- Provided technical support for Linux systems, including Red Hat, CentOS, and Ubuntu Server.
- Administered and supported Windows Server environments up to Server 2012, including Hyper-V virtualisation.
- Managed MySQL database administration, performing critical backups and restores.
- Configured, administered, and troubleshot DNS and email servers.
- Performed server updates and managed bespoke software installations on both Windows and Linux platforms.
Foundational Experience
-
Linux System Administrator | Infoserve (2013)
-
Unix Engineer (Contract) | British Telecom | Hewlett-Packard | IT Alliance (2011 – 2013)
-
Helpdesk Technician | A4e (2009 – 2011)
-
Assistant Network Analyst (Contract) | HSBC | Modis (2008 – 2009)
-
Network / Site Engineer | ask4 Limited (2008)
-
IT Technician (Work Placement) | JDR Cable Systems (2006 – 2007)
-
Technical Support Adviser (Part-time) | Wanadoo (2005 – 2006)
Certifications & Education
- Microsoft Certified: Azure AI Engineer Associate (AI-102)
- AWS Certified: Solutions Architect Associate
- GitHub Foundations Certified
- ITIL v3 Foundation
- Microsoft Azure Administrator (AZ-103) – Expired
- BSc in Computer and Network Engineering (Hons) – Sheffield Hallam University