Cardina logo

Senior Research Scientist, Model Evaluation - Cardina

View Company Profile
Job Title
Senior Research Scientist, Model Evaluation
Job Location
New York
Job Description
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

Why this role?
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real world use cases, we must continue to develop new techniques to accurately measure our models' performance on frontier capabilities. In this role, you are responsible for building scalable infrastructure and next-generation evaluation methods to measure LLM progress.
As a Senior Research Scientist, Model Evaluation, you will:
  • Define, implement, and experiment with radically new approaches to measuring model quality
  • Develop pristine evaluation datasets and state-of-the-art human evaluation methodology in partnership with our world-class, in-house human annotation program
  • Build robust and scalable evaluation tooling, utilized by all members of technical staff at Cohere, including our leadership and our CEO
  • Learn from and work with the best researchers and engineers in the field
  • You may be a good fit if:
  • You have direct experience developing foundation models
  • You have a track record of developing new methods or data to evaluate foundation models
  • You directly worked with data annotators to create cutting-edge text datasets
  • You have published at top-tier conferences and venues (ICLR, ACL, NeurIPS)
  • You are obsessive about rigorously and accurately measuring AI capabilities
  • You have strong software engineering skills
  • Everything You Need, One Platform.

    From job listings to startups, investors to funding rounds, and everything in between, Employbl puts the power in your hands. Why wait?

    Start your free trial today!

    Stay Ahead of the Curve

    Sign up for our newsletter to stay informed about the latest startups and trends in the tech market. Let Employbl be your guide to success.

    Cardina Headquarters Location

    New York, NY

    View on map

    Cardina Company Size

    Between 10 - 50 employees

    Cardina Founded Year


    Cardina Total Amount Raised


    Cardina Funding Rounds

    View funding details
    • Seed

      $3,100,000 USD

    • Seed

      $150,000 USD