ML & Data Engineer

AI Research • Industrial AI • Data Engineering

Manuel Caipo
Manuel Caipo

Machine Learning Engineer

About Me

Machine Learning Engineer with a background in Mechanical Engineering, working at the intersection of artificial intelligence, data engineering, and complex engineering systems.

I develop machine learning systems and data infrastructures for industrial cyber-physical environments, combining physical modeling, industrial domain knowledge, and scalable data pipelines to build reliable and interpretable models for real-world engineering applications.


Industry

During my work in the mining industry at Freeport-McMoRan, following a highly selective recruitment process (100 selected from over 16,000 applicants), I developed machine learning systems for predictive maintenance of large-scale equipment, improving asset availability and enabling data-driven operational decision-making.

Currently at Bosch Rexroth, I work on AI-based condition monitoring for hydraulic systems, designing industrial data pipelines and real-time telemetry architectures that integrate sensor streams, messaging systems, and machine learning models for intelligent monitoring and decision support.

Research

My research focuses on structured machine learning for engineered systems, including:

  • graph-based representations of industrial systems
  • physics-aware learning methods
  • machine learning models that incorporate system architecture and physical constraints

More broadly, I am interested in how machine learning and data infrastructures can augment engineering design, operation, and maintenance, enabling intelligent systems that reason about complex machines and improve their performance over time.


Vision

My long-term goal is to pursue a Ph.D. in Artificial Intelligence and Machine Learning, with a focus on computational methods for the analysis, modeling, and design of complex engineering systems.

My research interests lie in learning-based approaches that integrate machine learning with structured representations of engineering systems and physical knowledge. I am especially interested in developing intelligent computational tools that support the analysis, design, and operation of advanced engineered systems, enabling more systematic and data-informed engineering processes.


Research Keywords

AI for Engineering Design Intelligent Diagnosis Physics-informed ML Graph Learning for Cyber-Physical Systems Industrial AI Predictive Maintenance


Focus Areas

Machine Learning for Engineering Systems

Technologies

  • Python (NumPy, Pandas, SciPy, PyTorch)
  • Time-series modeling
  • Graph-based learning approaches
  • System identification
  • Physics-informed machine learning

Development of machine learning methods that integrate engineering knowledge and physical constraints to improve monitoring, reliability, and decision-making in complex industrial systems.

Physics-Aware and Structured Machine Learning

Technologies

  • Graph Neural Networks
  • Causal modeling approaches
  • Physics-informed neural networks
  • Explainable AI (SHAP, PDP)

Research on learning approaches that incorporate system structure, causal relations, and physical constraints into machine learning models.

Industrial AI and Predictive Maintenance

Technologies

  • LSTM / CNN architectures
  • Survival models for RUL prediction
  • Feature engineering for industrial sensor data
  • Explainable AI

Design of ML systems for anomaly detection, degradation modeling, and remaining useful life (RUL) prediction using high-frequency sensor data from large-scale industrial assets.

Scalable Data and ML Systems for Industry

Technologies

  • Docker-based architectures
  • Dagster / Apache Airflow pipelines
  • SQL / NoSQL systems (PostgreSQL, InfluxDB, Snowflake)
  • Streaming technologies (Kafka, Solace, MQTT)
  • Cloud ML systems (Azure ML, Databricks)

Development of data pipelines, telemetry systems, and deployable machine learning architectures for cyber-physical systems.

Analytics

Download CV
Interests
  • Machine Learning for Engineering Systems
  • Physics-informed Machine Learning
  • Industrial AI and Predictive Maintenance
  • Cyber-Physical Systems
Education
  • M.Sc. Computational Science and Engineering

    University Ulm

  • M.Sc. in Advanced Precision Engineering

    Hochschule Furtwangen

  • Diplom in Machine and deep Learning

    Universidad catolica san pablo

  • B.Sc. in Mechanical Engineering

    Universidad Nacional de San Agustin de Arequipa

Professional Journey
2024–Present | Data Scientist Werkstudent | Masterand – Bosch Rexroth (Ulm, Germany)
  • Industrial ETL Pipeline: Designed and implemented a complete ETL process to transmit CtrlX sensor data to a relational database via Solace messaging, fully containerized in Docker
  • Data Processing Optimization: Developed and optimized stored procedures for restructuring raw sensor streams into machine-learning-ready formats
  • RUL Prediction & Orchestration: Built predictive models for Remaining Useful Life (RUL) of hydraulic systems using Hidden Markov Models, clustering, and XGBoost — orchestrated via Dagster
2021–2023 | Data Science – Freeport-McMoRan (Global Mining Operations)

Presidential Award — July 2022
Honored with the President’s Award and Innova 2022 (1st Place Digital Transformation) for predictive wear models improving plant availability by +1.5%.

  • Global ML Deployment: Azure ML pipelines for 200+ heavy assets
  • Data Infrastructure Optimization: SQL preparation time reduced from 8h → 22min
  • Adaptive Learning Systems: Continuous retraining on multivariate operational data
  • Decision Intelligence: Power BI dashboards for maintenance optimization
2015–2021 | Engineering Foundations (Peru)
  • IMCO Servicios: FEM and CFD simulations for mining components
  • Academic Excellence: Top 1% Mechanical Engineering (UNSA)
Academic Milestones
  • M.Sc. Computational Science and Engineering — University of Ulm
  • M.Sc. Advanced Precision Engineering — Hochschule Furtwangen
  • Postgraduate Diploma — Machine Learning & Deep Learning — UCSP
  • B.Sc. Mechanical Engineering — UNSA
Featured Publications

Experience

  1. Data Scientist – Werkstudent

    Bosch Rexroth
    • Designed and implemented an ETL pipeline in Docker for processing and storing industrial data in a data lake and relational database.
    • Integrated Solace as a message broker to optimize industrial data flow.
    • Developed a monitoring system for hydraulic systems to detect anomalies and analyze operating cycles.
  2. Junior Data Science Analyst 2

    Freeport-McMoRan – Cerro Verde
    • Scaled predictive wear models across global mining sites using Azure ML Jobs for automated predictions.
    • Optimized data preprocessing using SQL stored procedures for efficient ML training.
    • Deployed retraining automation processes to continuously update models with historical machine data.
  3. Junior Data Science Analyst 1

    Freeport-McMoRan – Cerro Verde
    • Deployed ML models for daily wear prediction of primary crushers and cyclone pumps using Azure ML.
    • Automated reporting systems to compare current machine states with historical performance data.
  4. Junior Data Analyst 1 – Reliability Engineering

    Freeport-McMoRan – Cerro Verde
    • Developed machine anomaly strategies based on historical failure analysis.
    • Created Power BI dashboards and reports for monitoring and decision-making.
    • Optimized SQL queries for faster data processing in maintenance analytics.
  5. Trainee Data Analyst – Reliability Engineering

    Freeport-McMoRan – Cerro Verde
    • Performed long-term analysis of equipment availability and operational patterns.
    • Focused on primary crushers, conveyors, cyclone pumps, ball mills, and HPGR systems.
  6. Junior Engineer – Technical and Development

    IMCO Servicios S.A.C
    • Created structural calculation reports using simulation software like SAP2000, Inventor, Ansys Structural - Fluent and AutoCAD 3D.
  7. Intern – Reliability Engineering

    Freeport-McMoRan – Cerro Verde
    • Optimized production processes using multiphysics CFD simulations with tools such as Ansys Structural, Fluent, Tekla, and Ametank.

Education

  1. M.Sc. Computational Science and Engineering

    University Ulm
  2. M.Sc. in Advanced Precision Engineering

    Hochschule Furtwangen
  3. Diplom in Machine and deep Learning

    Universidad catolica san pablo
    GPA: 19.6/20.0
  4. B.Sc. in Mechanical Engineering

    Universidad Nacional de San Agustin de Arequipa

    GPA: 15.23/20.0

    National Scholarship “Beca Presidente del Peru” Winner, ranking the first in the department for five consecutive years.

Contact Me
E-mail
Connect