Zoox logo

Audio Signal Processing and ML Engineer - Zoox

View Company Profile
Job Title
Audio Signal Processing and ML Engineer
Job Location
Foster City, CA
Job Description
At Zoox, you’ll collaborate with a team of world-class engineers with diverse backgrounds in areas such as AI, robotics, machine learning, localization, computer vision, and simulation. You’ll master new technologies while working on code, algorithms, and research in your area of expertise to create and refine key systems and move Zoox forward.

The Audio Team at Zoox is seeking an experienced audio perception engineer. In this role, you will develop and implement novel machine learning architectures and algorithms for understanding and processing audio events in complex urban settings. You will have access to the sensor data from state-of-art microphone arrays and an incredible infrastructure for algorithm testing and validation. You will develop critical audio perception systems that enhance our autonomous vehicles' environmental awareness and safety, while working with cutting-edge technology to shape the future of transportation
In this role, you will:
  • Design and implement real-time perception algorithms for audio event detection and environmental sound understanding
  • Develop and optimize audio processing software pipelines for autonomous vehicle applications
  • Collaborate with cross-functional teams to integrate audio perception with other sensor modalities
  • Build and maintain production-quality code for deployment on GPU/CUDA systems
  • Develop multimodal sensor fusion algorithms for environmental event detection including foundational models
  • Qualifications
  • Master's degree or PhD in Computer Science or related field
  • Experience with signal processing algorithms related to audio and speech processing
  • Experience with machine learning techniques such as deep neural nets (DNN, CNN, LSTM-RNN) and traditional statistical modeling/feature extraction techniques (GMM, HMM, NMF / spectrograms, MFCC etc) for voice recognition and audio event classification.
  • Hands-on and theoretical background of signal processing techniques including adaptive filtering, filter banks and wavelet processing, speech analysis and synthesis, speech and audio coding
  • 5+ years of industry experience in machine learning
  • Strong programming skills in C++ and Python
  • Expertise in deep learning frameworks (PyTorch, TensorFlow, or similar)
  • Demonstrated ability to handle large datasets efficiently
  • Strong mathematical foundation in probabilistic technique
  • Bonus Qualifications
  • Experience with sensor fusion
  • Background in autonomous systems or robotics
  • Publications in conferences related to computational auditory scene analysis (D-CASE, ICASSP, or WASPAA)
  • Experience with contact sensor or accelerometer data processing
  • Knowledge of microphone array processing and beamforming
  • Everything You Need, One Platform.

    From job listings to startups, investors to funding rounds, and everything in between, Employbl puts the power in your hands. Why wait?

    Start your free trial today!


    Stay Ahead of the Curve

    Sign up for our newsletter to stay informed about the latest startups and trends in the tech market. Let Employbl be your guide to success.

    Zoox Headquarters Location

    Foster City, CA

    View on map

    Zoox Company Size

    Between 2,000 - 5,000 employees

    Zoox Founded Year

    2014

    Zoox Total Amount Raised

    $1,005,000,000

    Zoox Funding Rounds

    View funding details
    • Convertible Note

      $200,000,000 USD

    • Series B

      $465,000,000 USD

    • Series A

      $50,000,000 USD

    • Series A

      $250,000,000 USD

    • Seed

      $40,000,000 USD