Senior Software Engineer (Platform & Infrastructure)
About Simbian
Simbian® is building an Agentic AI platform for cybersecurity. Founded by repeat successful security founders, we have gathered an excellent cohort of employees, partners, and customers.
Our mission is to solve security using AI and our core values are excellence, replication, and intellectual honesty. Our promise is to make Simbian the best workplace of your career, and we believe a small group of thoughtful, passionate people can make all the positive differences in the world.
Our AI Agents automate security operations and provide our customers with 10x leverage. Our customers include some of the world’s largest companies.
Our initial use cases include:
AI-based SOC alert triage and investigation
AI-based Threat Hunting
AI-based Penetration Testing
Role — Senior Software Engineer (Platform & Infrastructure)
You will design and build the foundational infrastructure powering our agentic security platform — secure multi-tenant systems, distributed services, scalable execution layers, and the internal platform capabilities that allow our AI agents to operate reliably in production environments.
You’ll work closely with founders, product, AI, and security engineering teams to build systems that are secure, observable, resilient, and scalable from day one.
This role is ideal for engineers who think deeply about distributed systems, platform architecture, reliability, infrastructure, and backend scalability.
In your first few months, you will:
Own end-to-end design and delivery of platform capabilities
Build and scale distributed backend services powering AI agents and customer workflows
Improve authentication, authorization, and tenant isolation across the platform
Design infrastructure and observability systems for reliability at scale
Deploy and operate production-grade systems with strong ownership beyond launch
Collaborate cross-functionally with engineering, product, customer-facing teams, and leadership to shape platform architecture and engineering standards
What You’ll Work On
Platform & Distributed Systems
Build high-scale, multi-tenant backend systems that securely support multiple customer environments
Design and operate distributed microservices with strong reliability and fault tolerance
Build asynchronous execution pipelines, event-driven architectures, and workflow orchestration systems
Improve scalability, throughput, concurrency, and system performance across services
Design resilient APIs, service communication patterns, and infrastructure abstractions
Infrastructure & Reliability Engineering
Build and maintain cloud-native infrastructure across AWS/Azure
Improve deployment systems, CI/CD pipelines, container orchestration, and runtime reliability
Design platform capabilities around monitoring, observability, tracing, logging, and alerting
Build systems for operational visibility, debugging, incident response, and production diagnostics
Drive engineering best practices around resiliency, uptime, scalability, and operational excellence
Security & Access Systems
Design and implement authentication and authorization systems across services
Build secure multi-tenant access control systems, RBAC models, and policy enforcement layers
Improve platform security posture through secure infrastructure and backend design principles
Work closely with security engineering teams on platform hardening and safe execution models
AI Infrastructure Enablement
Build backend infrastructure powering AI agent execution and orchestration
Design memory retrieval, ingestion, indexing, and context-access systems for agents
Support scalable evaluation pipelines, traceability, and execution monitoring for AI workflows
Collaborate with AI engineering teams to improve platform performance and reliability for agent-based systems
You’ll Thrive Here If You
Think deeply in systems and architecture
Enjoy solving ambiguous engineering problems from first principles
Care about reliability, maintainability, scalability, and elegant abstractions
Like owning systems end-to-end, including production operations
Enjoy building infrastructure that other engineers depend on
Understand the tradeoffs involved in distributed systems and platform design
Enjoy reading RFCs, architecture docs, and learning new technologies
Requirements
Must Have
6+ years of strong backend or platform engineering experience
Strong understanding of distributed systems fundamentals
Experience designing and operating microservices at scale
Experience with authentication and authorization systems (OAuth, RBAC, JWT, SSO, IAM, etc.)
Strong knowledge of cloud infrastructure (AWS/GCP/Azure), containers, and CI/CD systems
Experience with observability tooling, monitoring systems, logging, tracing, and production debugging
Familiarity with asynchronous systems, queues, event streams, and workflow orchestration
Strong coding experience in Python, Go, Java, or Node.js
Ownership mindset — ability to take systems from concept → production → scale
Bonus / Great to Have
Experience working on platform engineering or internal developer platforms
Experience with Kubernetes, Kafka, OpenTelemetry, Prometheus, Grafana, or similar tooling
Experience building multi-tenant SaaS platforms