Technical Product Manager

Remote, USA
Posted Jun 12, 2026
Full-time

At Robots & Pencils, we build meaningful, scalable digital products that solve real business problems. We are looking for a Staff Product Manager who combines deep Generative and Agentic AI fluency with hands-on building ability to own AI product outcomes end-to-end. As a Staff PM, you're accountable for initiative-level outcomes, stakeholder satisfaction, and contributing to R&P's AI product practice. You think in systems, work backwards from the customer problem, and stay relentlessly curious about what's next in AI

 

Enterprise clients want to deploy Agents -  moving from a promising demo to a production system that works at scale, meets security and compliance requirements, and delivers measurable business value is hard. This role owns that problem. You'll be part of a GenAI initiative within the AWS ecosystem, building the evals, tools, patterns, and reference architectures that make AI deployment repeatable.  The mindset: prove it works, test assumptions early, and document while building.

Key Responsibilities

Product Strategy & AI Vision

Define and drive the product vision, strategy, and roadmap for GenAI solutions - with agentic AI (agent orchestration, tool use, multi-step workflows) as the primary focus - connecting AI capabilities to enterprise business outcomes

Translate enterprise problems into structured product requirements; reframe feature requests into outcome-driven priorities with explicit tradeoffs on invest in vs. defer

Balance near-term deployment milestones with long-term platform scalability and sustainability

Monitor the competitive GenAI landscape and emerging agentic patterns to inform roadmap and technology decisions

Discovery & Validation

Research how enterprise users interact with AI agents and where they lose trust; frame the riskiest assumptions as testable hypotheses and de-risk them first

Design and run experiments - POCs, pilot deployments, scenario-based testing of multi-step workflows, edge cases, and failure recovery - to validate agentic solutions where non-deterministic output makes traditional QA insufficient

Distill research, experiments, and competitive intelligence into clear insights that pave the path for a successful product

Agent Design, Prototyping & Production

Define agent behavior and prototype system prompts and tool schemas; partner with engineering on context management - summarization, working memory, and information flow across multi-step tasks

Drive multi-model architecture tradeoffs with engineering - define the quality, cost, and latency targets that determine which model serves each step in the agent workflow

Build AI prototypes to validate hypotheses; define human-in-the-loop boundaries and guardrails - when the agent acts autonomously, when it escalates, and how to handle non-deterministic output

Establish agent evaluation frameworks - task completion, reasoning quality, tool selection, failure recovery, safety - and partner with engineering on production readiness (observability, drift, responsible AI, prompt versioning)

Define success metrics at the agent level - task completion rate, cost per task (not per inference), escalation rate, time to resolution, and customer trust alongside business KPIs

Delivery & Execution

Own the end-to-end product lifecycle from discovery through phased rollouts; establish the metrics framework (north star, input, guardrail metrics) and report product impact to leadership

Manage the product backlog, scope, dependencies, and risks; drive agile ceremonies and produce high-quality PRDs, product briefs, and decision logs

Evaluate technology and platform decisions from a product perspective; create deployment playbooks, reference architectures, and knowledge transfer materials so teams sustain solutions independently

Use AI to accelerate product work - research, analysis, prototyping, documentation - with judgment on when it needs human oversight; onboard rapidly to new domains and support team members across the initiative

 

Stakeholder Management

Build trusted relationships with stakeholders and executives; serve as the go-to product advisor and primary contact for AI product direction and deployment strategy

Partner with AWS Solution Architects and account teams to align on technical approach, service selection, and go-to-market for GenAI solutions

Manage expectations on scope, timelines, and tradeoffs; facilitate decisions across competing priorities using data, alternatives, and clear rationale

Frame AI capabilities and limitations for non-technical stakeholders - manage hype cycles, set realistic expectations; surface unmet needs that deepen relationships and grow the account

Required Skills

8-12+ years in product management, forward deployment, or solutions engineering; must have shipped AI products from prototype through production at scale

Strong product sense - ability to identify what matters to users and the business, make prioritization calls with incomplete information, and shape products that deliver real outcomes

Deep GenAI fluency - LLMs, RAG, fine-tuning, prompt engineering, context engineering, evals - with hands-on experience building or shipping agentic systems (planning, tool use, HITL, guardrails)

Proven ability to prototype AI solutions using AI tools (Cursor, Claude, Copilot) to validate hypotheses and de-risk product decisions

Experience deploying AI solutions in enterprise environments with strong technical fluency - can read code, evaluate architectures, make product tradeoffs on technical constraints, and drive scalable deployment patterns

Exceptional communicator - clear PRDs, technical specs, and decision logs; has led AI products through full lifecycle and driven alignment with Directors, VPs, and C-level

Comfortable operating in ambiguous, fast-moving environments where the AI landscape evolves weekly

PM-level fluency across the AWS AI ecosystem - Bedrock, AgentCore, SageMaker, Strands, Kendra, OpenSearch, Lambda, Step Functions - to make informed product and architecture decisions

Preferred Qualifications

Software engineering or coding background (Python, JavaScript, TypeScript)

Agency or consulting delivery experience

Experience in Financial Services, Healthcare, or Life Sciences industries

Familiarity with open-source LLM ecosystem (Llama, Mistral) for flexibility and cost optimization

Prior experience leading time-boxed discovery initiatives or technical spikes with rapid validation cycles

Why Join R&P?

You'll work at the intersection of cutting-edge AI and real enterprise impact - helping clients deploy Generative and Agentic AI solutions that change how their businesses operate. R&P gives you the variety of consulting (new problems, new industries, new tech) with the depth of a product role - you'll build, ship, and measure, not just advise. The team is collaborative, technically sharp, and genuinely invested in doing great work for clients.

More Remote Jobs