Staff Engineer, API Core Platform
Together AI
Staff Engineer — Product Engineering Core Platform
About the role
Together AI is seeking an experienced Staff Engineer to own and evolve the core product runtime that powers our Cloud application surfaces. This role sits at the intersection of infrastructure and application architecture and is responsible for ensuring our systems are stable, scalable, and cleanly structured as we grow.
You will lead the technical evolution of our production runtime and application architecture, strengthening the foundation that product engineers build on every day. In the near term, this includes improving the performance, reliability, and structure of our existing Next.js-based application and helping drive the separation of responsibilities across the stack. Over time, you will shape the long-term architecture of the product platform so teams can ship quickly and confidently at production scale.
This is a deeply hands-on Staff role for an engineer who has operated real production systems, improved them under pressure, and led structural change. You will be the technical owner of how the product runs in production — ensuring it remains fast, resilient, and easy to evolve as the company scales.
You will help shape the foundation that a rapidly growing product engineering organization builds on — transforming today’s systems into a platform that supports speed, reliability, and scale for years to come.
Responsibilities
- Own the evolution of the product runtime and application architecture, ensuring systems are resilient, scalable, and production-ready
- Drive structural improvements within the existing web application, improving system boundaries, performance, and long-term maintainability
- Lead initiatives that strengthen production stability, performance, and reliability across the application layer
- Define and evolve runtime architecture patterns for server-side behavior, request handling, and scaling
- Improve deployment safety, release confidence, and environment consistency across product engineering
- Establish strong observability practices across the application stack, including logging, metrics, tracing, and debuggability
- Identify and address structural bottlenecks that slow development or introduce operational risk
- Partner with the API Platform team to help extract and separate API responsibilities from the application layer
- Collaborate with the UI Platform team on runtime performance, framework behavior, and production characteristics of the web stack
- Drive performance optimization efforts across the application tier, including latency reduction, scaling behavior, and resource efficiency
- Improve CI/CD architecture and operational maturity to support fast, safe iteration
- Mentor engineers and influence architectural direction across teams
Requirements
- 8+ years of experience building and operating large-scale production systems
- Strong hands-on experience with Node.js in production environments
- Strong proficiency with TypeScript in large, complex codebases
- Proven experience evolving real-world systems into stable, scalable platforms
- Deep understanding of system design, performance, and reliability at production scale
- Experience improving runtime stability, performance, and operational maturity of server-side systems
- Experience working across application and infrastructure layers
- Demonstrated ability to drive architectural change across teams without formal authority
- Strong experience with CI/CD systems, deployment workflows, and production operations
- Experience establishing or improving observability (logging, metrics, tracing) in production environments
- Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or equivalent practical experience
Bonus
- Experience with Next.js and/or React in large-scale production environments
- Experience evolving monolithic web applications into modular or service-oriented architectures
- Experience operating production workloads in Kubernetes-based environments
- Experience optimizing SSR performance and server-side runtime behavior
- Experience improving system performance under high request volume and growth
- Experience with Golang
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $240,000 - $275,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at https://www.together.ai/privacy