Description
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
Senior Staff Software Engineer - AI Agentic Infrastructure & Systems
THE ROLE:
At AMD, we are redefining the paradigm of low-level system software development. We are seeking a Senior Staff Software Engineer to develop the deep integration of high-autonomy agents (e.g., Claude Code, Cursor) into our system software development workflow.
In this role, you will architect an automated, closed-loop system - from requirement description to engineering task verification. By leveraging Verification-Driven Engineering and Feasibility Analysis to determine solvability within existing frameworks, while enabling the Agent to autonomously acquire and synthesize new skills through iterative self-learning, you will transform complex low-level engineering challenges into predictable, controlled agentic tasks, while architecting autonomous agents capable of independent problem-solving and self-evolving technical skills.
KEY RESPONSIBILITIES:
1. Architect Verification-Driven Agentic Workflows
- Multi-Agent Collaboration: Implement strategies involving specialized roles (e.g., infra-Architect, Debug-Coder, QA-Validator) to ensure high-quality engineering task output and minimize hallucinations.
- Domain-Knowledge Centric RAG: Build high-precision retrieval systems using LangChain (LCEL) to index massive repositories, PDFs, and Confluence pages, utilizing advanced strategies like Parent Document Retrieval and Semantic Chunking.
- Complex State Machines: Design and implement cyclic, multi-step reasoning architectures using LangGraph to manage long-running coding tasks and "reflection" loops.
2. Autonomous Execution & Self-Correction
- Zero-Touch Provisioning: Develop systems where agents autonomously set up sandboxed runtimes, resolve dependencies, and configure infrastructure.
- Autonomous Test Synthesis: Architect engines that generate edge-case reproduction scripts and validate fixes within isolated CI/CD pipelines.
- Self-Healing Remediation: Engineer loops that enable agents to parse execution logs, identify root causes, and iteratively apply patches until tests pass.
3. Benchmarking & Optimization
- Performance Evaluation: Lead the evaluation of agentic performance using industry-standard benchmarks (e.g., SWE-bench), aiming for top-tier recovery rates.
- Trace Analysis: Utilize LangSmith for deep trace analysis, debugging complex agent trajectories, and optimizing prompt/chain latency and cost.
PREFERRED EXPERIENCE:
- AI Agent Architecture: Proficient in architecting autonomous AI agents using LangGraph, AutoGen, and LangChain. Proven experience in building self-correcting engineering workflows and validating performance via benchmarks like SWE-bench.
- System Programming Excellence: Deep experience in C/C++, with expert knowledge of Linux, memory management, and interrupt handling. Familiar with modern software development process, including complex CI/CD pipelines.
- MCP and Skills Development: Experience in custom MCP Servers and Skills.
- AI Developer Insight: Advanced user of AI tools (Cursor, Claude Code) or developer of LLM-based agentic plugins. Deep understanding of Prompt Engineering and debugging strategies for non-deterministic systems.
- Engineering Philosophy: Strong belief in "Verification as the Boundary." Ability to decompose complex NP-level engineering problems into automatically verifiable P-level tasks.
NICE TO HAVE:
- Hardware Verification: Hands-on experience with Board Bring-up and proficiency with low-level diagnostic tools such as JTAG, xbutil, and dmesg.
- Experience with AMD Vitis or AIE programming.
- Compiler Background: Proficiency with semantic analysis
- Familiarity with computational complexity theory and its application to software efficiency
ACADEMIC CREDENTIALS:
- Master's or PhD in Electrical Engineering, Computer Science, or related field.
LOCATION:
- San Jose
This role is not eligible for visa sponsorship.
#LI-GW1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's “Responsible AI Policy” is available here.
This posting is for an existing vacancy.
Apply on company website