Back to Jobs

Machine Learning Enginer, Core Evaluations

Remote, USA Full-time Posted 2026-06-13

About Cantina: Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning. If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future! About the Role: We are seeking an experienced Machine Learning Engineer (MLE) to focus on audio model evaluation, specifically for speech generation and recognition models. This role involves designing and developing comprehensive model evaluation pipelines for both development and production environments, as well as creating automated dashboards for reporting evaluation results. As the founding member of our evaluation team, the ideal candidate is expected to leverage their experience to lead our evaluation efforts and play a key role in the future growth of the evaluation team. What You’ll Do: Designing model evaluation pipelines for models in development and production Designing user studies for subjective model evaluations. Converting requirements into measurable metrics. Designing and developing automated evaluation dashboard to see model performances and compare results. Training new models to capture new and different evaluation metrics. Communicating with the model team to help design better models based on the evaluation results. Communicating with the data team to help decide the type of data necessary to improve model performance. Communication with the product-manager to make sure product requirements are correctly measured. Help grow the evaluation team as the founding member. Lead the evaluation team in the future. What You’ll Bring: Strong experience and intuition for designing metrics that capture model performance. Strong experience with designing user studies on Mechanical Turk or similar platforms. . Strong experience with model training and fine-tuning for model evaluation. Strong statistical knowledge and experience to statistically compare evaluation results and take decisions. Very strong engineering and programming skills. Experience with training ASR, TTS models. Experience at ML teams working on large-scale machine learning problems. (>3B models with >1m hours of data) Apply To This Job

Similar Jobs

Project Manager

Remote, USA Full-time

2nd shift - Call Center Representative (58728)

Remote, USA Full-time

Senior Software Engineer, Full-Stack (DIT)

Remote, USA Full-time

Director of Product, TMS Systems

Remote, USA Full-time

Senior Operations Program Manager

Remote, USA Full-time

Compliance Specialist - Remote/Travel - (LIHTC/Affordable Exp Required) - Bryten

Remote, USA Full-time

Part time (2nd shift) - Call Center Representative (58729)

Remote, USA Full-time

Marketing Analytics Manager

Remote, USA Full-time

Senior Channel Marketing Manager

Remote, USA Full-time

Senior HR Generalist

Remote, USA Full-time

Client Services Manager (Bilingual)

Remote, USA Full-time

Experienced Learning Experience Designer – WW Customer Trust Training, Risk LXD Team at arenaflex

Remote, USA Full-time

Experienced Customer Support Representative – Delivering Exceptional Experiences for arenaflex Customers (Remote, Part-Time)

Remote, USA Full-time

Experienced Live Chat Agent – Deliver Exceptional Customer Support Experience

Remote, USA Full-time

Immediate Hiring: Seasonal Customer Service Representative-Remote (Bilingual: Spanish and English)

Remote, USA Full-time

Remote Licensed Marriage and Family Therapist or Social Worker (LCSW, LMFT, LPCC)

Remote, USA Full-time

Global Study Lead

Remote, USA Full-time

Experienced Livechat Support Specialist – Delivering Exceptional Customer Service in a Remote Setting

Remote, USA Full-time

Director, Data Privacy Compliance

Remote, USA Full-time

Customer Service and Sales Representative – Talkeetna, AK

Remote, USA Full-time