Staff Research Engineer

Remote, USA
Posted Jun 14, 2026
Full-time

About Us:

Ambience is developing the most capable AI systems for healthcare and medicine. As healthcare costs soar to 17.3% of US GDP and a projected shortage of 100,000 physicians within the next decade, the need for AI is critical. Our frontline healthcare workers are overwhelmed, with only 27% of the average clinician's day spent on direct patient care.

Our vision is to advance healthcare by empowering clinicians with safe, intelligent AI agents that improve quality, reduce costs, and enhance both patient and provider experiences.

Headquartered in San Francisco, we have secured $100M in funding from top investors, including Kleiner Perkins, OpenAI Startup Fund, Andreessen Horowitz, Optum Ventures, Human Capital, and Martin Ventures. We collaborate with leading AI experts such as Jeff Dean, Richard Socher, Pieter Abbeel, and AIX Ventures.

Join us in the endeavor of accelerating the path to safe & useful clinical super intelligence by becoming part of our community of problem solvers, technologists, clinicians, and innovators.

The Role:

As a Staff Research Engineer at Ambience, you will push the boundaries of generative AI by translating cutting-edge research into working prototypes and experimental platforms. You’ll work closely with fellow researchers, engineers, and product leads to explore novel architectures, fine-tuning methods, evaluation paradigms, and data strategies—helping to define what’s possible with frontier AI models.

Our engineering roles are hybrid — working onsite at our San Francisco office three days per week.

What You’ll Do:

  • Prototype and Advance LLM Systems: Build and benchmark LLM-based systems and agents using open-source and proprietary models. Rapidly prototype new capabilities through fine-tuning, adapters, and reinforcement learning approaches.

     Drive Research-First Experimentation: Translate recent academic papers into reproducible experiments, focusing on fine-tuning (e.g., LoRA, QLoRA, DPO), model alignment, and hallucination mitigation techniques. Design clear experiment plans and share findings across the team.

    Build and Evolve Evaluation Pipelines: Define evaluation methodologies using human-in-the-loop feedback, synthetic benchmarks, and task-specific metrics. Implement continuous evaluation pipelines to track regressions and breakthroughs.

    Shape Data and Training Strategy: Curate datasets via synthetic generation, targeted scraping, and annotation pipelines. Establish practices for discovering failure cases and improving model robustness over time.

    Contribute to a Research-Driven Culture: Write research papers, internal memos, and blog posts. Foster a culture of experimentation, documentation, and knowledge-sharing across research and engineering teams.

    Who You Are:

    • Research Fluent

      Skilled at interpreting and replicating results from cutting-edge machine learning research.

      Experienced in designing experiments, running ablation studies, and ensuring reproducibility.

      4+ years of experience in machine learning research, experimental AI, or applied AI engineering.

      Demonstrated ability to replicate, extend, or publish original research.

      Deep Expertise in LLM Fine-Tuning

      Hands-on experience fine-tuning large language models and optimizing prompt and embedding strategies.

      Proficient with Python and deep learning frameworks such as PyTorch, JAX, and Hugging Face Transformers.

      Comfortable with distributed training environments and large-scale model experimentation.

      Evaluation and Data Obsessed

      Deep understanding of dataset curation, filtering, and alignment with evaluation goals.

      Familiar with human annotation pipelines, ranking models (e.g., RM, RLAIF), and interpretability techniques.

      Experienced in building evaluation frameworks tied to real-world task performance.

      Collaborative and Curious

      Thrives in research-driven environments with a commitment to experimentation, documentation, and cross-functional learning.

      Excited to prototype, present findings, and build at the frontier of AI advancement.

      Effective Interdisciplinary Collaborator

      Able to work alongside clinicians, product managers, and fellow engineers

      Strong communicator who can distill complex ML concepts for diverse audiences.

      Mission-Aligned

      Passion for healthcare or other mission-driven industries (e.g., education, climate tech)

      Thrives in a fast-paced, early-stage environment; takes extreme ownership of deliverables

      Nice-to-haves

      Open-source contributions to ML libraries, datasets, or benchmarks

      Experience working in AI research labs, frontier model companies, or early-stage AI startups

      Background in RLHF, alignment research, or AI safety

      Compensation

      $250,000 - $350,000, with the addition of significant equity.

      Are you outside of the range? We encourage you to still apply; we take an individualized approach to ensure that compensation accounts for all of the life factors that matter for each candidate.

      Being at Ambience: 

      An opportunity to work with cutting edge AI technology, on a product that dramatically improves the quality of life for healthcare providers and the quality of care they can provide to their patients

      Dedicated budget for personal development, including access to world class mentors, advisors, and an in-house executive coach

      Work alongside a world-class, diverse team that is deeply mission aligned

      Ownership over your success and the ability to significantly impact the growth of our company

      Competitive salary and equity compensation with benefits including health, dental, and vision coverage, quarterly retreats, unlimited PTO, and a 401(k) plan

More Remote Jobs