Sr Data Engineer – Build GenAI Data Infrastructure – Data Engineer – Maher’s Project

LATAM

About Distillery

Distillery Tech Inc accelerates innovation through an unyielding approach to nearshore software development. The world’s most innovative technology teams choose distillery to help accelerate strategic innovation, fill a pressing technology gap, and hit mission-critical deadlines. We support essential applications, mobile apps, websites, and eCommerce platforms through the placement of senior, strategic technical leaders and by deploying fully managed technology teams that work intimately alongside our client’s in-house development teams. At Distillery Tech Inc, we’re not here to reinvent nearshore software development, we’re on a mission to perfect it. Distillery Tech Inc is committed to diversity and inclusion. We actively seek to cultivate a workforce that reflects the rich tapestry of perspectives, backgrounds, and experiences present in our society. Our recruitment efforts are dedicated to promoting equal opportunities for all ing for a skilled and motivated Software Engineer with a strong backgroucandidates, regardless of race, ethnicity, gender, sexual orientation, disability, age, or any other dimension of diversity.


About the Position

We are seeking for a Data Engineer with a passion for building scalable, cloud-native data pipelines to power cutting-edge GenAI products. The ideal candidate will bring deep expertise in modern data architectures, a strong command of Azure and Databricks, and the ability to collaborate with AI engineering teams to deliver innovative, high-impact solutions. This is a unique opportunity to help shape data systems that enable real-time retrieval-augmented generation (RAG), embeddings workflows, and GenAI applications working across structured and unstructured datasets.


Responsibilities

  • Build, deploy, and maintain scalable data pipelines on Azure.
  • Design and optimize data schemas and architectures tailored for machine learning and GenAI workloads.
  • Develop and orchestrate ETL/ELT workflows using Databricks, Spark, and Airflow.
  • Ensure data quality, observability, and governance using modern validation and monitoring tools.
  • Collaborate closely with AI engineering teams to enable retrieval workflows, prompt chaining, and embedding storage.
  • Participate in sprint planning, code reviews, and technical design discussions in a collaborative environment.
  • Contribute to high-impact projects that drive innovation at the forefront of GenAI.


Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 8+ years of professional experience in Data Engineering roles.
  • Strong expertise in Azure, Python, Databricks, Airflow, Spark, SQL, and Docker.
  • Experience designing scalable data pipelines and optimizing architectures for machine learning or GenAI use cases.
  • Familiarity with structured and unstructured data handling, data governance, and observability best practices.
  • Ability to work fluently in English, essential for cross-team collaboration.
  • A passion for solving complex data challenges in a fast-paced, innovation-driven environment.


Nice to Have

  • Experience with LangChain, LLMs, or GenAI architectures.
  • Exposure to vectorstore technologies such as FAISS, Chroma, or Pinecone.
  • Experience working on 0-to-1 projects or building scalable solutions from the ground up.


Why You’ll Like Working Here

  • Collaborate with multi-national teams committed to our core values: Unyielding Commitment, Relentless Pursuit, Courageous Ambition, and Authentic Connection.
  • Enjoy a competitive compensation package, generous vacation, and comprehensive benefits.
  • Work remotely in a flexible, supportive environment.
  • Access professional and personal development opportunities to advance your career.