Profile
Senior data engineer and Databricks developer with 7+ years designing, building, and maintaining scalable data pipelines on Azure, GCP, Informatica, TIBCO EBX, and SnapLogic. Strong background in Apache Spark, Databricks, SQL, Python, FastAPI, Spring, and React, modernizing batch ETL and analytical workflows into cloud-native architectures. Proven work in data quality, governance, software architecture, ETL orchestration, and collaboration with analytics and backend teams.
Experience
Advanced Data Engineer & Senior Quality Engineer
- Designed and implemented a configuration-first automation framework for functional testing of ML-driven recommendation flows.
- Integrated Spring-based applications with ReportPortal (metrics/traceability) and QMetry (quality engineering tracking).
- Led data quality, validation, and governance strategies for enterprise data platforms.
- Built performance-testing automation (sudden spike, stochastic TPS noise, endurance, high-TPS scenarios) and improved QMA for existing solutions at scale.
Senior Quality Engineer
Faculty Lecturer (part-time)
- Data structures and algorithms 2 — industry-oriented course (Spanish).
- Modern data mining patterns for computing engineers (English).
Senior MDM Data Engineer
- Enterprise pipelines with IDMC, IICS, PowerCenter, and Databricks for batch and near-real-time processing.
- Java Spring REST APIs (layered, domain-driven) integrating internal apps with on-prem MDM; reduced data consumption bottlenecks by roughly 30%.
- MDM architecture for customer domains: CTE-heavy SQL, procedures, modeling; support for analytics and insurance/risk reporting.
- Production support and UAT; FastAPI integration with Power BI for ETL job traceability; OpenAPI/Swagger for API schemas.
MDM Data Engineer
- Databricks pipelines ingesting SAS, relational DBs, and APIs; Python ETL and SQL optimization.
- Workflow integration with Azure (monitoring, CI/CD, storage); Power BI on harmonized datasets.
MDM Designer & Software Designer (consultant)
- Enterprise integration architectures across SAS, Databricks, TIBCO EBX, and SnapLogic.
- Distributed J2EE workload distribution with governance-as-code, CI/CD, and modular subsystems matured through agile delivery.
- Automated batch, reconciliation, and DQ validation for very large datasets (e.g. ~250M rows) to meet SLOs.
- Extended MDM via Java APIs and REST; FastAPI backends with NoSQL integrating third parties (e.g. Google Maps, OpenAI SDK).
Software Development Engineer
- Test automation and backend development with Spring Boot and .NET Core; DDD and data-driven design for two enterprise applications.
MDM & Statistics (Actuarial)
- Analytical pipelines and BI over SAS actuarial data; automated SQL and SnapLogic ETL.
Technical Lead & Founder
- Data-driven educational platforms (Python, FastAPI, Manim, Java); CI/CD with GitHub Actions.
Education
MSc in Computer Science (PCIC)
Focus on solution architectures and tradeoffs, modern software design, and integration with AI operations.
BSc in Physics
Scientific programming, simulation, and data analysis.
BSc in Mathematics
Selected projects
Professional and research work complementing the public repositories listed below.
Cattle accounting system
ETL and ML on transactional and historical data to estimate cattle counts from drone video; Spark aggregations for reporting.
Connecting (POS platform)
Centralized POS for small businesses using cloud services; monolith plus real-time messaging (WebSockets), token-based auth.
Urban expansion modeling
AC–GA pipeline with unsupervised training to forecast urban land use (~80% accuracy) for planning. Related repository: CityModelling (thesis work).
Public GitHub repositories
All public repositories under NoSoyRo (name, primary language, notes).
| Repository | Language | Notes |
|---|---|---|
.NET-Projects | Jupyter Notebook | .NET coursework and exercises. |
ARTIFICIAL-INTELLIGENCE-PROJECTS | Jupyter Notebook | Portfolio work as a data analyst. |
ArtificialIntelligence | Python | |
Basic-Problems | Java | Practice across languages and paradigms. |
CalcManim fork | — | Multivariable calculus animations (Manim). |
CityModelling | Jupyter Notebook | Master's thesis project. |
CloudAuthentication | Java | JPA and Spring Security sample (web app). |
DataSetCreation | Java | Random dataset generation for MDM scenarios. |
DataStructures | Java | Data structures in Java. |
DataStructuresC- | C# | Data structures course (C#). |
Django | Python | |
django-crud-example fork | — | Django CRUD (function-based views). |
Django-Projects | Python | |
eCommmerce | Java | |
etl-mdm | Java | MDM inner stages with Spring Batch. |
fractalcruncher_v1n fork | — | Iterated functions and visualization. |
Lab_contemporanea_2 | Jupyter Notebook | |
LeetCode | Python | Practice solutions. |
mdm-framework | Jupyter Notebook | MDM-related experiments and notebooks. |
NoSoyRo-pre-pro fork | Jupyter Notebook | |
NoSoyRo-seg_aplic_2026-1 fork | — | Course: application security fundamentals. |
nosoyro.github.io | JavaScript | This site. |
Physics | — | Simulation workflows. |
PI-SOLUTIONS | JavaScript | Pi Solutions projects. |
ProgramacionAvanzada | Java | MSc advanced programming coursework. |
proyecto-pa | Java | |
Proyectos_Personales | Jupyter Notebook | Homework scripts and personal experiments. |
Security | C# | ASP.NET Identity exploration. |
TheGame | — | Dissemination, coursework, and side projects. |
Earlier visual demos (calculus sums, Fourier, collisions, n-body, and more) live under TheGame and Proyectos_Personales.
Skills & certifications
- Programming
- Python (5+ years), Java (Spring Boot/Cloud), SQL (SQL Server), .NET Core
- Data engineering
- Databricks, Apache Spark, ETL, data quality, governance, IICS, PowerCenter, MDM, SnapLogic, TIBCO EBX, Spring Batch, SAS-style modernization
- Software engineering
- Spring ecosystem, FastAPI, Django, React, Node.js, JBoss, Tomcat, NGINX, CI/CD, DDD, microservices, design patterns
- Cloud & DevOps
- Azure, GCP, Git / GitHub / GitLab
- Analytics
- Power BI, OLAP, feature engineering
- Languages
- Spanish (native), English (full professional)
Certifications
- Informatica MDM Developer — Aug 2023
- SnapLogic Professional Developer — Jun 2022
- SnapLogic Administrator / Architect — Oct 2022
- Microsoft Azure Fundamentals — Dec 2023