T
Talent@ Beta
Xai

Member of Technical Staff - Model Evaluation

Xai · Series B · Website

Role Details

Location
Palo Alto, CA
Salary
$180,000 - $440,000
Department
Model
Type
Full-time
Vertical
AI
Posted
1 week ago

Job Description

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

RESPONSIBILITIES:

  • Provide complete assessment of models.
  • Deep dive into model training and data to identify the weakness point revealed in evaluation.
  • Communicate with modeling and data team to come up with plans to improve model quality.

BASIC QUALIFICATIONS:

  • Model assessment and evaluation task development  (including public and in-house benchmarking).
  • Collect data and synthesize data for new evals.
  • Build infrastructure and framework for easy-to-use model evaluation, familiarity with inference frameworks like SGlang and vLLM.

COMPENSATION AND BENEFITS: 

$180,000 - $440,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

About Xai

Elon Musk's AI company building Grok. Focus on understanding the universe.

View company profile

Similar roles at other companies

Member of Technical Staff, Model Efficiency
Cohere · Series D+ · New York
Technical Program Manager – Adversarial Model Research
Openai · Series D+ · San Francisco
Technical Lead Manager, Visualization
Foxglove · Series B · San Francisco, CA
Robotics Engineer - Vision Language Action Model
Sensmore · Seed · Berlin / Potsdam
Technical Program Manager, Frontier AI Research
Deepmind · Acquired · Mountain View, California, US
Technical Project Manager QVAC (100% remote Worldwide)
Tether · Late Stage · Remote job

You'll be redirected to the company's application page

Get roles like this daily

Join our Telegram channels for curated job alerts