// AVAILABLE FOR INTERNSHIPS · 2026
Victor Kipruto Rop
Data Engineer · Pipeline Architect · Data Architect
Building production-grade data systems that scale — from Kafka streams to Airflow orchestration. Experienced in designing reliable ETL, low-latency streaming, and warehouse modelling. Based in Nairobi, Kenya.
// 02
About Me
I'm a final-year BSc Data Science student at The Cooperative University of Kenya, building a career at the intersection of scalable systems and trustworthy data. My work is driven by a belief that clean, well-engineered data infrastructure is what separates organisations that can act on their data from those that only think they can.
Outside of building pipelines, I'm drawn to the growing data ecosystem across East Africa and the opportunity to work on infrastructure that has real operational impact — whether in aviation safety, energy, or financial services.
// 03
Learning & Project Experience
Final-Year BSc Data Science Student
The Cooperative University of Kenya
2022 - Present
Over the past year, I've specialized in data engineering and have built production-grade data systems through hands-on projects. Designed and implemented scalable ETL pipelines, real-time stream processing with Kafka, and warehouse infrastructure. These projects have equipped me with deep technical skills in Python, SQL, Airflow orchestration, and cloud architecture.
Hands-On Project Development
Building Real-World Data Infrastructure
Last 12 Months
Completed 5 major projects involving data pipeline design, cloud ETL systems, real-time transaction streaming, and warehouse optimization. Each project has significantly strengthened my understanding of production systems, best practices, and scalable architecture. I'm committed to continuous learning, eager to gain from experienced professionals, and passionate about building reliable data systems that drive organizational value.
// 04
Technical Skills
// 05
Professional Certifications
Big Data Foundations
IBM / Cognitive Class
2026
Comprehensive training in big data concepts, Hadoop ecosystem, and distributed computing fundamentals.
Enterprise Design Thinking
IBM
2026
Mastered user-centered design principles and collaboration techniques for solving complex business problems.
Professional Development
Verified Credential
2026
Continued learning and specialization in scalable data systems and engineering best practices.
// 06
Projects (5)
Data Engineering — Full Project
Designed and automated a complete star schema architecture with cloud-based data ingestion and optimised analytical queries for BI reporting.
// 06
Technical Blog
Welcome to My Technical Blog
An introduction to my technical blog where I share insights on data engineering, ETL pipelines, and real-time systems.
Read ArticleData Engineering Fundamentals: Building Scalable Systems
A comprehensive guide to data engineering fundamentals, covering architecture principles, ETL design patterns, and production-grade best practices.
Read ArticleAdvanced Airflow Patterns & Optimization
Deep dive into dynamic DAG generation, custom operators, monitoring strategies, and scaling Airflow to production workloads.
Read Article// 07
Testimonials & Feedback
"Victor's ability to design scalable data pipelines is exceptional. His Airflow implementations are production-ready and well-documented. A great addition to any data team."
Data Engineering Mentor
Project Lead • The Cooperative University
"Demonstrates exceptional understanding of cloud infrastructure, CI/CD pipelines, and distributed systems. Eager to learn and collaborate effectively with the team."
DevOps & Infrastructure Review
Technical Assessment • GitHub Projects
"Proficient in ETL architectures, real-time streaming with Kafka, and warehouse modelling. Shows strong fundamentals and brings fresh perspectives to complex problems."
Data Architecture Review
Technical Feedback • Project Collaboration
// 08
How I Work
Pipeline Design
I design resilient ETL/ELT frameworks. By treating data pipelines as software, I ensure modularity, observability, and scalability from the very first row.
Orchestration
I leverage Apache Airflow to turn complex, interdependent data tasks into deterministic, automated workflows with robust alerting and retry strategies.
Data Modelling
I bridge raw data to business value using dimensional modelling (Star/Snowflake), optimized for high-performance analytical query engines.
Data CI/CD
I treat data infrastructure with engineering rigor, implementing automated unit testing (Pytest) and deployment pipelines for constant reliability.
// 09
Talks & Contributions
Open Source Contribution
Improved error-handling documentation and resolved critical race conditions in a widely used Python data processing library.
View All Contributions →Tech Talk
Delivered a technical session on "Designing Fault-Tolerant Data Pipelines with Airflow" to 100+ students at a university tech meetup.
View All Talks →// 07
GitHub Activity
@Victor-Kipruto-RopLoading recent activity…
Top Repositories
Loading repositories…
// 07
Get in Touch
Schedule a Call
Want to discuss data engineering, project collaboration, or just chat about pipelines? Book a time that works for you.
Schedule on Calendly →Send a Message
Prefer email? Send me a message and I'll get back to you within 24 hours.
Powered by Formspree