Sr Data Engineer/Analyst (Azure & Databricks)

Null

About Distillery 

Distillery accelerates innovation through an unyielding approach to nearshore software development. The world’s most innovative technology teams choose Distillery to help accelerate strategic innovation, fill a pressing technology gap, and hit mission-critical deadlines. We support essential applications, mobile apps, websites, and eCommerce platforms by placing senior, strategic technical leaders and deploying fully managed technology teams that work intimately alongside our client’s in-house development teams. At Distiller  we’re not here to reinvent nearshore software development, we’re on a mission to perfect it. Distillery is committed to diversity and inclusion. We actively seek to cultivate a workforce that reflects the rich tapestry of perspectives, backgrounds, and experiences present in our society. Our recruitment efforts are dedicated to promoting equal opportunities for all candidates, regardless of race, ethnicity, gender, sexual orientation, disability, age, or any other dimension of diversity.


About the position

We're seeking for a Senior Data Analyst/Engineer to lead the migration of business logic from Power BI and Excel into a governed Databricks Lakehouse, embedded within the client's data team as the primary driver of that work. Work directly with business users to understand and reverse-engineer their existing reports, re-model and rebuild that logic in Databricks with performance and scalability in mind, validate the results, document definitions, and progressively transfer knowledge to the internal team.


Responsibilities

  • Reverse-engineer logic embedded in Power BI (DAX, semantic model relationships, Power Query) and Excel-based workflows, tracing it back to silver and source layers.
  • Design and build clean, well-modeled silver and gold/reporting layers in Databricks — applying dimensional modeling principles, medallion architecture, and coding best practices — so the visualization layer consumes small, fast, pre-validated datasets.
  • Partner with business SMEs (data team, finance, and other functions) to understand existing reporting processes and the metrics that must be exactly correct.
  • Develop and optimize incremental pipelines, adding validation and reconciliation consistent with the team's existing practices.
  • Build and maintain a data dictionary and semantic layer to support self-service and Genie / AI-assisted querying.
  • Contribute to data governance using Unity Catalog (access control, PII protection) in compliance with GDPR and CCPA requirements.
  • Support onboarding of new sources (e.g. Magento, CRM) into the platform.
  • Pair with and upskill the client's internal engineers and analysts; document work for ongoing internal support.

Requirements

  • Strong data modeling expertise: dimensional modeling, grain definition, star and snowflake schema design, SCD handling, and surrogate key strategy — able to assess whether an existing model reflects a reliable single source of truth and design the right replacement for performance and scale.
  • Hands-on Databricks experience: PySpark and SQL, Delta Lake, medallion architecture, incremental processing, and Unity Catalog.
  • Solid ability to reverse-engineer Power BI, including DAX, semantic model relationships, and Power Query, where much of the transformation logic typically lives.
  • Strong SQL and experience with the Azure data ecosystem (ADLS Gen2, Azure Synapse, Azure SQL / SQL Server).
  • Comfortable working directly with non-technical business users — eliciting requirements, validating logic, and translating between business needs and technical implementation.
  • Experience implementing data governance and handling sensitive / PII data in compliance with GDPR and CCPA requirements.
  • Self-directed, ownership-oriented mindset suited to an embedded staff-augmentation role with a clear delivery mandate.

Nice to Have

  • Exposure to Dynamics 365 F&O data structures and Synapse Link — given the source architecture, this is a meaningful advantage.
  • Experience building semantic layers and enabling natural language querying (e.g. Databricks Genie).
  • Experience integrating ecommerce (Magento) and CRM data sources.
  • Retail, ecommerce, or consumer goods data experience.
  • Experience using LLM-based tools to document or reverse-engineer legacy DAX and SQL logic at scale.


Why You'll Like Working Here

Join a global team committed to Distillery's core values: Unyielding Commitment, Relentless Pursuit, Courageous Ambition, and Authentic Connection.

  • 100% Remote Work: Enjoy the freedom to work from anywhere while collaborating with a diverse, multinational team.
  • Competitive Compensation: Generous and competitive package in USD, along with a comprehensive benefits plan.
  • Flexible Hours: Create a schedule that aligns with your life and priorities.
  • Home Office Setup: Receive all the hardware and software needed to succeed from home.
  • Innovative Workplace: Collaborate with the global Top 1% of talent in a multicultural and dynamic environment.
  • Focus on Growth: Pursue professional and personal development while contributing your unique talents to a team where you can truly shine!