Computer Science Subject Matter Expert - AI Evaluation (US-Remote) Job at Braintrust, Austin, TX

U3l5emZObzZRbUQ1dnJyc29yNzN4WVBOZlE9PQ==
  • Braintrust
  • Austin, TX

Job Description

Computer Science Subject Matter Expert - AI Evaluation (US-Remote) Join to apply for the Computer Science Subject Matter Expert - AI Evaluation (US-Remote) role at Braintrust Base pay range $75.00/hr - $90.00/hr Job Description: Seeking multiple Computer Science Subject Matter Experts to help design, run, and optimize data collection and evaluation workflows for GenAI research. You’ll translate high-level research needs into scalable processes, produce and curate challenging domain problems, and ensure factual, bias‑aware, high‑quality datasets for LLM training. To Note: This is for an immediate project need. Project is approved for 3‑months initially, with possibility to extend based on project/client demands. To Note: Hourly rate range (75–90) is in USD per hour. Responsibilities: Partner with GenAI researchers/engineers to capture data needs and success criteria. Expand high-level requirements into clear, executable workflows for larger teams. Execute collection/evaluation workflows rapidly with minimal supervision. Innovate on workflows to maximize throughput and quality. Collaborate cross‑functionally to maintain quality at scale. Conduct in‑depth LLM‑assisted research; gather reliable, up‑to‑date info. Craft original, high‑quality content and hard problems for LLM eval/train. Perform rigorous fact‑checking (precision/recall) to prevent misinformation. Requirements: Education: Master’s with distinction or PhD in Computer Science; top‑tier institution preferred. Significant domain experience considered. Detail orientation; precise data presentation; thorough proofreading. Communication: articulate complex info; strong collaboration. Understanding of AI/LLMs, their capabilities/limits. Prompt engineering and familiarity with AI writing tools. Ethical AI awareness and data literacy (collection, cleaning, transformation). Thrives in fast‑paced, minimally supervised environments. Seniority level Entry level Employment type Full‑time Job function Engineering, Information Technology, and Science Industries Technology Information and Internet Computer and Network Security #J-18808-Ljbffr Braintrust

Job Tags

Remote job, Hourly pay, Full time, Immediate start,

Similar Jobs

ati

Finance (Accounting) Intern- 2026 Job at ati

 ...perform -- and so is our team. We're hiring high performers as proven as our products. Join us. We are currently seeking a Finance intern for our Natrona Heights, PA operations in our Accounting Department. Essential Functions Streamlining and simplification... 

Mongoose Trucking

CDL A Oil Field Drivers Job at Mongoose Trucking

 ...REQUIREMENTS ~ CLASS A CDL ~1 Year Driving Experience - Required ~6 Months Onsite Oil Field Experience - Required ~3-6 Months Winter Driving Experience - Preferred ~ Tanker Endorsement - Required~21 Years or Older ~ Need to be able to drive 13 and 18... 

Sitter.com

Sitter Wanted - Babysitter Wanted In Roswell Job at Sitter.com

Posting:Hey my name is I'm seeking a baby-sitter available in Roswell, Georgia. My aim is to find a dedicated person who has date night availability.Duties:My home needs date night supervising, overnight care, and pet care. Ideally, you am comfortable working with twins...

United Parcel Service Inc.

Seasonal Package Handler - Available Job at United Parcel Service Inc.

 ...Join the UPS team as a Seasonal Package Handler and kickstart your career today! In this role, you will play a crucial part in the fast-paced environment...  ...to work hard! Your Responsibilities: Handle packages weighing up to 70 lbs Maintain attention to detail... 

Skaggs Community Hospital Association

MRI-Xray Technologist WEO Cert Job at Skaggs Community Hospital Association

 ...Description :Operate Magnetic Resonance Imaging (MRI) scanner(s) within all guidelines of MRI safety. Demonstrates a thorough understanding of MRI safety zones and practices all safety guidelines. Interview/screen patient for appropriateness of exam and patient history...