Principal Data Scientist (Databricks, GenAI, LLMs)

7000 euro gross

About the Role

We’re looking for a Principal Data Scientist who thrives at the intersection of machine learning, modern data platforms, and real business impact. This is not a traditional researcher role — you’ll act as a solution architect, working hands-on with Databricks and GenAI technologies to design and deliver real-world ML solutions.

In this role, you'll collaborate with both enterprise clients and the Databricks team, navigating ambiguous requirements, identifying opportunities for GenAI and ML, and building robust pipelines that go to production.

Responsibilities

In this role you will:

Lead full-cycle ML solution development: from problem discovery to production deployment.
Build and optimize ML pipelines using Databricks (Notebooks, Delta Lake, Unity Catalog, MLflow, Serving Endpoints).
Apply GenAI techniques: prompt engineering, fine-tuning, agent design, RAG, chatbots.
Translate business problems into data science solutions that deliver measurable value.
Work closely with stakeholders and clients to understand real-world constraints and guide delivery.
Collaborate with Data Engineers, DevOps, and PMs in cross-functional squads.

Why This Role Is Unique

Architect-level ownership: You'll be responsible for designing scalable ML pipelines from scratch, choosing the right tools and frameworks for the job.
GenAI in production: Go beyond experimentation — apply LLMs, agents, and RAG pipelines to solve real client problems.
Databricks at scale: Work with modern components like MLflow, Unity Catalog, Delta Lake, and Serving Endpoints — often in direct collaboration with the Databricks team.
Impact-first consulting: Every project is different. You’ll work across telecom, fintech, and retail, shaping analytics strategy from the ground up.
Shape the DS practice: Be one of the first hires in a growing Data Science team. Influence standards, tools, and the culture of delivery.

Requirements

Must-Have:

4+ years of end-to-end Data Science experience with a focus on production-grade ML.
Strong Python skills and practical experience with scikit-learn, pandas, numpy.
Hands-on experience with Databricks, including MLflow, Delta Lake, Unity Catalog.
Experience building and deploying GenAI use cases (prompting, RAG, chatbots, etc.).
Proficiency in SQL and working with analytics layers.
Excellent communication and stakeholder management skills.
Comfort working in fast-paced, ambiguous environments with multiple stakeholders.

Nice-to-Have:

3+ years of consulting or client-facing experience.
Certifications in Databricks (ML Associate, ML Professional) or relevant cloud platforms (AWS, Azure, GCP).
Familiarity with Docker, orchestration tools, and data pipelines.
Experience working with Spark or large-scale data processing tools.
Knowledge of MLOps practices and CI/CD for ML workflows.

Location

EU, LatAm
(Full remote, b2b contract)

Technologies We Use

Core: Python, SQL, scikit-learn, Spark, PySpark, Delta Lake
Cloud & Platform: Databricks, Unity Catalog, Azure / AWS
MLOps: MLflow, GitHub, JIRA
GenAI: OpenAI, HuggingFace, LangChain, RAG pipelines, LLM APIs

Conditions

Fully remote setup and flexible working hours
T1A Christmas Holidays as an additional paid vacation time coordinated for the entire company.
Public Holidays according to the legislation of the country of your residence.
Annual performance review, with the possibility of salary and/or bonus revision.
Close partnership with Databricks and high-profile clients
Work on production systems, not research experiments
Opportunity to help shape internal DS standards and direction
International, experienced team with minimal bureaucracy

Share this job opening

Application:

First name

Last name

Phone number

Cover letter

Link to CV (If You Have One)

Upload CV

By applying to this job opening you confirm your consent to processing your personal data and accept Dina Veprikova Privacy Policy