RE/RS, Data Understanding - Foundations
Posted bythe hiring team· 2 days ago
- Location
Posted bythe hiring team· 2 days ago
RE/RS, Data Understanding - Foundations
USD 445,000 – USD 555,000
Top 2% in Data
Be among the first applicants
Verified team
HR-vetted before going live.
Transparent pay
Salary stated upfront.
Be among the first applicants
Just opened — your application stands out.
About this role
About The Team
The Data Understanding team is responsible for creating the high quality datasets and their quantized representation for the company. This includes synthesizing data, building VQ representations, and processing, filtering, deduplication, quality control, and tokenization so it can be used effectively in big model training runs.
About The Role
We're looking to advance how the company builds and understands pretraining data at scale. You'll treat data quality and curation as core research problems: developing new methods to select, combine, and transform data; creating datasets that improve model capabilities; and designing rigorous experiments to understand how data choices and interventions affect model learning and downstream behavior. You'll work closely with frontier models and web-scale data to build evidence for which approaches work and why, then translate successful research into scalable data processing pipelines
We Expect You To
Have a strong track record of new or improved ML ideas, through publications, projects, or applied research.
Own and drive a research agenda, from choosing the right problems to carrying long-running work through to impact.
Be excited by the company empirical, collaborative approach to research.
Nice To Have
Thoughtfulness about AI’s impact, including privacy, provenance, and data quality.
Experience building high-performance deep learning or large-scale data processing systems.
About the company
the company is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
The RE/RS, Data Understanding - Foundations role with the hiring team offers USD 445,000–555,000 per year. Salary information is published as part of every JobRemotely listing so candidates can self-screen before applying.
Yes — the hiring team has marked this RE/RS, Data Understanding - Foundations role as open to candidates based in United States. Eligibility requirements are surfaced in the JobPosting structured data on the listing.
The hiring team uses the JobRemotely structured hiring pipeline: candidates apply through the listing, complete a paid test task or screening, and only then proceed to interviews. This skips the resume black hole and respects everyone's time.
Similar roles
Hand-picked from the same category.
the hiring team· San Francisco·Remote·2 days ago
USD 228,960 – USD 315,360
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 170,400 – USD 223,200
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 297,000 – USD 330,000
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 260,000 – USD 288,000
View