About the position
We’re looking for a Senior Data Scientist who’s excited about building robust, high-impact ML systems with a strong focus on textual data, NLP, and modern LLM-based architectures. This role is perfect for someone who thrives in startup environments, loves shipping hands-on solutions, and brings sharp product thinking to every line of code or experiment they run. You’ll work closely with product, engineering, and data teams to build and scale ML-powered systems that drive meaningful outcomes for users and the business.
Responsibilities
- Work closely with product managers to understand customer needs, prioritize effectively, and identify the minimum valuable ML that creates the most impact
- Operate with a product mindset: make smart tradeoffs between speed and quality, scope and value — and know when to ship fast and when to go deep
- Drive high standards and share knowledge across the team and company through thoughtful collaboration, peer reviews, and mentorship
- Design, implement, and own end-to-end machine learning features and pipelines — from ideation and prototyping to productionization and monitoring
- Build and scale NLP/LLM-driven systems that turn raw product text into structured representations (e.g., classifications, embeddings, summaries, agents)
- Think and operate in production: handle scale, monitoring, failure modes, and iteration — not just research
Requirements
- 6+ years of hands-on experience as a data scientist, including at least 2 in early-stage startups or fast-paced environments.
- 3+ years leading full-cycle ML projects — from research to production, adoption, and monitoring.
- Deep familiarity with building ML products using textual inputs.
- Experience building LLM-based systems.
- Leverage AI-powered tools like Cursor, Claude Code, GitHub Copilot, or ChatGPT as part of your workflow to accelerate development, debug faster, and explore ideas more effectively.
- You bring strong product and user intuition — you’re not just building models, you’re solving real problems.
- You’re a great teammate: collaborative, communicative, and fun to work with.
- Master’s degree in Computer Science, Engineering, Statistics, or a related technical field.
Advantages
- Experience developing or orchestrating agents (e.g., multi-step reasoning workflows, tool-using systems, etc.).
- Experience with MLOps tools (e.g., MLflow, Airflow, Weights & Biases).
About Harmonya
Retailers and CPG brands rely on product data to make critical decisions, but outdated systems limit what’s possible. Harmonya changes that.
Our AI-powered solutions transform fragmented, inconsistent product data into a dynamic, structured, and enriched source of truth. By analyzing trillions of alternative data points, we help leading CPGs and retailers—including Coca-Cola, Nestle, PepsiCo, and more—gain deeper insights, improve product discovery, and make smarter, faster decisions. Founded in 2021, Harmonya is backed by investors including Bright Pixel, Team8, Susa Ventures, J Ventures, and others.