Nuvepro - Task Intelligence for the Enterprise
xAI· Human Data· Remote

AI Tutor - Japanese

Classified Tasks (11)

Automate 0%Augment 91%Human-Only 9%

Augment (10)

AI assists, human decides

Train and refine Grok to improve voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts

technical

Curate high-quality multilingual audio datasets for model training and evaluation

operational

Annotate multilingual audio clips, voice recordings, speech samples, and auditory elements with accurate labels and metadata

operational

Use proprietary software to provide labels, annotations, recordings, and inputs on multilingual audio projects

operational

Support delivery of curated audio data that yields clear, natural spoken output and meets professional audio standards

operational

Ensure annotated data accurately represents linguistic and prosodic details such as intonation, rhythm, and accent

analytical

Collaborate with technical staff to design tasks that improve the model’s handling of speech modulation, accent variation, and noise in real-world recordings

communication

Work with technical staff to improve annotation tools and workflows for efficient audio data processing

technical

Enable natural spoken interactions for users worldwide by improving multilingual speech processing pipelines

technical

Bridge language barriers through accurate speech processing and by improving the AI’s handling of multilingual audio nuances

communication

Human-Only (1)

Requires human judgment

Provide high-quality voice recordings and audio inputs for use in model development and testing

creative

Job description

ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI's mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your work will focus on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI's handling of multilingual audio nuances. RESPONSIBILITIES: Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: Native proficiency in Japanese with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities. PREFERRED SKILLS AND EXPERIENCE: Demonstration of exceptional attention to linguistic nua
Source: xAI careers · scraped 2026-05-22
Apply at xAI