About TrustLab
Online misinformation, hate speech, child endangerment, and extreme violence are some of the world's most critical and complex problems. TrustLab is a fast-growing, VC-backed startup, founded by ex-Google, TikTok and Reddit executives determined to use software engineering, ML, and data science to tackle these challenges and make the internet healthier and safer for everyone. If you’re interested in working with the world’s largest social media companies and online platforms, and building technologies to mitigate these issues, you’ve come to the right place.
About the Role
As an AI Safety analyst, you will be engaging on the full spectrum of policy issues on AI Safety and play an integral role in building deep expertise within the team. You will work directly on solving real world complex trust & safety and fraud issues. Your work will be critical in the design & development of our AI safety product & service offerings.
Day-to-day work may encompass anything from risk helping to shape strategic initiatives, technical/policy research, risk evaluations and investigations. You will also get to work on adversarial and red-teaming opportunities to protect real users and improve AI security.
This role can be performed remotely from anywhere in Singapore or Palo Alto.
Responsibilities
Develop deep subject matter expertise in role of AI safety in cyber security risks Discover and exploit Responsible AI vulnerabilities end-to-end in order to assess the safety of systems by developing responsible AI red teaming methodologiesDevelop a framework for testing and benchmarking the safety of AI ModelsPlay a role in building & improving Gen AI fraud & risk detection capabilitiesMonitor the policy landscape to identify relevant questions and emerging policy areas to build our expertise in the subjectKeep up to date with new and existing AI policy norms and standards, particularly those related to cyber security, and use these to inform our decision-making on policy areas
Minimum qualifications
Bachelor's degree or equivalent practical experience3+ years track record in trust & safety, risk evaluations, fraud investigations, technical/data analysis Experience and familiarity with AI or a demonstrated interest in AI policy issuesExperience in data analysis or data science - identifying trends and drawing actionable insightsHave a deep practical familiarity with understanding of how AI technology contributes to online risks & threatsWorked on topics around: AI risk assessment, model safety, promptingStay up-to-date and informed by taking an active interest in emerging research and industryPassion for using AI to create safe and beneficial products
Preferred skills
Experience and familiarity with AI or a demonstrated interest in AI policy issues and researchStrong familiarity with existing GenAI / LLM / ML standards - prior experience exploring, testing and evaluation of language model behavior.Experience in benchmarking Generative AI issues and quantify improvementsExperience with SQL and a programming language (e.g., Python or R)
Opportunities and perks
Competitive compensation at a rapidly growing Series A, VC-backed startup Remote-first, with the ability to work from home or co-locate with our Singapore or Palo Alto teamsInfluence new product direction from idea to commercializationHelp develop critical tech to solve one of the 21st century’s trickiest societal problems