Speech AI Engineer

Remote, USA
Posted Jun 13, 2026
Full-time

About VORTO

Vorto is on a mission to increase sustainability and create more jobs by making supply chains more efficient across the entire value chain. Through powerful AI technology, Vorto's autonomous supply chain platform seeks to reduce carbon emissions caused by supply chain transportation, improve the lives of approximately 3.5 million truck drivers and create more jobs across all players in B2B transactions. We operate in a very fast-paced and nimble environment that is highly focused on a team-first, accomplishment-oriented culture that is passionate about the organization's success.

Our products have been developed by a world-class engineering team that simplifies complex business problems to a degree where adoption is effortless. We encourage you to visit our careers page and read this blog post to learn more about our culture.

What You'll Do

Own the Speech interface layer: real-time speech-to-intent pipeline using models like Whisper or Deepgram

Build conversational UX flows for tasks like:

Load search and booking

Navigation and status updates

Voice journaling and micro-coaching

Optimize for real-world conditions: noisy cabs, unreliable signal, driver interruptions

Design fallback logic and graceful failure handling when voice commands are ambiguous or partial

Develop personalization features to learn and adapt to each driver’s preferences, tone, and goals over time

Personalization Responsibilities

You’ll lead the design of systems that help the assistant become a

relationship-level tool, not just a command engine:

Build a driver memory graph to store goals, language patterns, habits, and personality traits

Develop logic for driver-specific intent recognition and natural phrasing (e.g., “get me home this weekend”)

Implement adaptive prompting based on driver profile tags (e.g., preferred tone, motivational style)

Enable the system to reference past conversations, track personal milestones, and make coaching feel deeply contextual and human

Use retrieval-augmented generation (RAG) or custom embeddings to let the assistant "remember" and evolve

Technologies You Might Use

Speech: Whisper, Deepgram, Google Speech-to-Text

NLU / LLMs: GPT-4o, Rasa, LangChain, custom intent parsers

Frontend: React Native (for fallback UI), WebRTC, Twilio Voice

Infra: Node.js, Python, Firebase, Supabase, Postgres, Pinecone

You Might Be a Fit If You

Have built or contributed to voice or AI-driven products with personalization or memory features

Understand the difference between "generic commands" and relationship-based coaching

Know how to ship fast, but keep latency and UX top of mind

Are excited to build tools that respect and elevate working-class users

Are obsessed with human-first AI design

Benefits

At VORTO we are committed to developing our employees and providing them exciting opportunities to grow and prosper in their careers. We encourage you to visit our careers page and read this blog post to learn more about our culture.

We offer a competitive benefits package as well as numerous additional perks including:

Competitive compensation package

Health, Dental and Vision Insurance

401k with matching

Company paid life and short-term disability insurance

Company paid parking or transit pass

Relocation offered when applicable

Modern office space in downtown Denver

Daily coffee, tea, drinks & snacks

Team happy hours

VORTO is an Equal Opportunity Employer.

This position will be posted until a qualified candidate is hired.

Disclaimer: This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee. Other duties, responsibilities and activities may change or be assigned.

More Remote Jobs