Sonatype logo

Senior Data Scientist - Sonatype

View Company Profile
Job Title
Senior Data Scientist
Job Location
Hyderabad
Job Description
Sonatype is the software supply chain security company. We provide the world’s best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale.

As founders of Nexus Repository and stewards of Maven Central, the world’s largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.

More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.


The Opportunity
We’re looking for a Senior Data Scientist to join our growing AI & Data Science team.
You’ll operate as an internal AI consultant and technical lead, helping multiple teams across Sonatype apply machine learning and generative AI to real-world problems.
You’ll explore complex datasets, design experiments, build models, and collaborate closely with product engineering, and security experts to turn research ideas into practical, scalable solutions.
This role is ideal for someone who thrives on autonomy, loves translating ambiguous ideas into working systems, and enjoys working across boundaries rather than in a single product lane.
What You’ll Do
  • Lead applied AI projects from concept to impact — prototype, validate, and help teams deploy practical ML and GenAI solutions.
  • Collaborate cross-functionally: Partner with product, engineering, and research teams to scope problems, identify opportunities, and co-develop solutions.
  • Act as an internal consultant: Advise teams on ML/AI best practices, model evaluation, and productive use of generative technologies.
  • Design robust experiments and establish evaluation pipelines for model reliability, accuracy, and business impact.
  • Bridge research and production: Package research insights into usable APIs, tools, or workflows for other teams.
  • Explore new techniques (e.g., LLMs, embeddings models, retrieval-augmented generation, agentic workflows) to enhance developer and security experiences.
  • Share knowledge and mentor peers, helping elevate the organization’s AI literacy and capabilities.
  • What We’re Looking For
  • 6+ years of experience in applied data science, machine learning, or AI research
  • Strong Python skills and hands-on experience with ML/AI libraries and platforms such as Databricks, OpenAI API, and Scikit-learn
  • Comfortable working with large, messy, or unstructured datasets — you know how to turn chaos into features, insights, and beautiful visualizations
  • Deep familiarity with LLMs and GenAI ecosystems (e.g. OpenAI, Claude, Hugging Face): skilled in prompt engineering, parameter tuning, and evaluating model behavior against ground truth
  • Experience taking ML or GenAI systems from prototype to production, even if small-scale or incremental
  • Strong analytical thinking, experimentation skills, and appreciation for trustworthy, data-driven evaluation
  • Proficiency with Git and collaborative code workflows (GitHub or similar)
  • A balanced mindset — equally comfortable exploring research ideas and implementing production-ready systems
  • Proactive and self-directed: you don’t wait for perfect specs; you find meaningful problems and drive them to completion
  • Bonus Points
  • Experience with AI-assisted coding tools (Copilot, Claude Code, Codex, etc.)
  • Familiarity with agentic workflows, Model Context Protocol (MCP), and tool-use integrations
  • Exposure to cybersecurity, anomaly detection, or code analysis
  • Understanding of MLOps practices (MLflow, AWS SageMaker, model serving, or monitoring)
  • Everything You Need, One Platform.

    From job listings to startups, investors to funding rounds, and everything in between, Employbl puts the power in your hands. Why wait?

    Start your free trial today!


    Stay Ahead of the Curve

    Sign up for our newsletter to stay informed about the latest startups and trends in the tech market. Let Employbl be your guide to success.

    Sonatype Headquarters Location

    Fulton, MD

    View on map

    Sonatype Company Size

    Between 200 - 1,000 employees

    Sonatype Founded Year

    2008

    Sonatype Total Amount Raised

    $154,707,328

    Sonatype Funding Rounds

    View funding details
    • Private Equity

      $80,000,000 USD

    • Private Equity

      $30,000,000 USD

    • Series C

      $25,000,000 USD

    • Series B

      $9,999,999 USD

    • Series A

      $5,207,329 USD

    • Series A

      $4,500,000 USD