Skip to content
Available for new projects
Senior Data Engineer

Build data systems that hold up in production

AI-powered cloud pipelines and LLM integration for enterprise data teams.

Book Intro Call View Case Studies

Portrait of Andres Avila, Senior Data Engineer

$750K Costs Reduced
2,000+ Hours Automated
67% Pipeline Speedup
5 Cloud Certifications

About me

Hi, I'm Andres — a Senior Data Engineer based in México, with 5+ years building cloud data systems for companies including Hershey's, EY/Microsoft, Wizeline, and 8am.

I focus on AI-powered data systems: combining cloud pipeline engineering (Azure, AWS, Snowflake, dbt) with LLM integration to help teams turn raw data into reporting and analytics they can rely on. My work spans the full data lifecycle — from architecture and ingestion to transformation, orchestration, and business-facing analytics.

I'm currently open to Backend Contracts — long-term engagements with teams that need a senior data engineer without the overhead of a full-time hire.

What I can help you with

  • Cloud Data Engineering


    I design and build production pipelines on Azure, AWS, and Snowflake — from raw ingestion to semantic modeling with dbt. Clean, observable, maintainable infrastructure.

  • LLM Integration


    I connect structured data to language models to automate analysis, classification, and reporting — with a live product, biopanel.io, behind it rather than just demos.

  • Analytics Engineering


    Semantic layers, metric definitions, and dashboards your team can trust. I've built reporting infrastructure used by analytics and data science teams at large enterprises.

  • Data Migration


    I've led client-specific migration workflows with full validation pipelines: extraction, transformation, profiling, defect detection, and deployment — with documentation your team can maintain.

Why work with me?

  • Certified across all major clouds


    Five active cloud data certifications — credentials behind the work, not a substitute for it. AWS DEA-C01 Databricks Pro SnowPro Core Azure DP-203 Fabric DP-700

  • Concrete, measurable outcomes


    $750K in vendor costs removed and 2,000+ hours automated at Hershey's; a 67% runtime reduction at EY/Microsoft; a 32% pipeline speedup at Wizeline. I work toward outcomes I can point to.

  • Full-stack when it matters


    I built biopanel.io end-to-end: FastAPI backend, Celery + Redis async pipeline, PostgreSQL, React frontend. I can talk to your product and engineering teams in the same language.

  • Remote-first, clear communication


    I've worked remotely across EY, Microsoft, Hershey's, Wizeline, and 8am. Clear communication, regular updates, and documentation — you always know where things stand.

Tech stack

Cloud

  • Azure (ADF, Synapse, Fabric, Functions)
  • AWS (S3, Glue, Redshift, Lambda)
  • Snowflake
  • GCP basics

Orchestration & Transformation

  • Apache Airflow
  • dbt
  • PySpark
  • Azure Data Factory

AI / LLM

  • OpenAI API
  • LangChain
  • Pinecone
  • Retrieval-Augmented Generation
  • Streamlit

Languages

  • Python
  • SQL
  • DAX
  • Shell/Bash
  • YAML
  • TypeScript (basics)

DevOps

  • Azure DevOps
  • GitHub Actions
  • Docker
  • Terraform
  • CI/CD pipelines

Frequently asked questions

What kind of contracts are you available for?

I'm primarily open to Backend Contracts (BECs) — long-term staff augmentation engagements (typically 20–32 hrs/week) where I embed as a senior data engineer in your team. I'm also open to well-scoped project-based contracts for data migrations, pipeline buildouts, or LLM integration work. Minimum engagement: 20 hours.

What industries have you worked in?

Consumer goods (Hershey's), professional services and enterprise software (EY/Microsoft), technology (Wizeline), and health tech (biopanel.io — personal project). I'm most effective with enterprise teams that have complex data at scale — fintechs, SaaS platforms, and healthcare are strong fits.

How do you handle confidentiality?

I sign NDAs before any project begins. All credentials and sensitive data are managed with enterprise-grade security practices — never hardcoded, always encrypted at rest and in transit. I can work within your existing security policies.

What's your typical engagement structure?

I prefer a short paid discovery phase (5–10 hours) to fully understand your architecture and data before committing to a full engagement. This eliminates surprises on both sides. From there, I work in weekly cycles with async updates and a standing check-in.

Where are you based? Do you work with international clients?

I'm based in Aguascalientes, México (UTC-6). I work remote-first and have experience with distributed teams across US, EU, and LATAM timezones. Fluent in English and Spanish.

What are your rates?

I price based on scope and value delivered, not hours logged. Let's talk during our intro call — I'll be direct about what makes sense for your situation.

  • Let's see if we're a fit


    I take on a limited number of clients at a time to ensure quality. Book a free 30-minute call to discuss your data challenges and what an engagement could look like.

    Book Intro Call