about the company.
We provide critical safety and performance layers for next-generation LLM applications, ensuring seamless integration of AI into global business ecosystems.

about the team.
We focus on building the "operating system" for AI services—solving high-concurrency challenges, optimizing distributed architectures, and setting the gold standard for service reliability at a global scale.

responsibilities.
-Design and refine the end-to-end service operating model for high-traffic AI ecosystems.
-Define system behaviors under extreme load, implementing advanced failure handling, timeouts, and degradation strategies.
- Lead technical initiatives to strengthen the AI and API layers, eliminating structural single points of failure.
- Coordinate cross-component decisions and translate complex design patterns into reusable frameworks and high-quality documentation.
-Drive hands-on implementation while influencing technical direction across various engineering modules.

skills and experience required.
-7+ years in backend or platform engineering with a proven track record in designing scalable, high-traffic services. ...

ai sre associate director.

job details

share this job.

get in touch

梁靖 Stephanie Liang

let similar jobs come to you