Description
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
What You Do at AMD Changes Everything
At AMD, we build products that power the future of high‑performance computing—from AI and data centers to gaming and embedded systems. Our culture values execution excellence, collaboration, and innovation. We believe meaningful progress comes from bold ideas and strong technical ownership. Join us as we shape the future of secure, scalable computing.
THE ROLE:
The Datacenter Graphics and Accelerated Computing Validation team is seeking a GPU Security / Firmware Validation Engineer to drive end‑to‑end validation of security‑critical GPU SoC features across pre‑silicon and post‑silicon environments. This role is hands‑on and highly technical, focused on silicon bring‑up, firmware enablement, system‑level validation, and debug for AMD's datacenter GPU platforms used in AI, Machine Learning, and High‑Performance Computing.
You will work closely with silicon design, firmware, driver, platform, and Product Security Office (PSO) teams to validate security, isolation, virtualization, RAS, and fuse‑related features from SoC through VBIOS, system firmware, and OS layers. This role plays a critical part in delivering secure, high‑quality GPU platforms to customers.
THE PERSON:
You are a self‑starter who thrives in fast‑paced, lab‑centric environments and can independently drive complex validation and debug efforts. You are comfortable leading technical discussions, mentoring junior engineers, and collaborating across teams to resolve challenging hardware and software issues. You bring strong problem‑solving skills, disciplined execution, and a passion for secure system design and validation.
KEY RESPONSIBILITIES:
- Own Datacenter GPU SoC post‑silicon security and firmware validation, spanning silicon, VBIOS, system firmware, drivers, and OS layers
- Drive pre‑silicon validation using emulation and simulation environments and transition coverage to post‑silicon platforms
- Develop and execute feature enablement and validation test plans for SoC‑ and system‑level security, virtualization, RAS, and fuse features
- Eagerness and ability to quickly learn new concepts
- Lead post‑silicon debug efforts, performing system‑level root cause analysis across HW/FW/SW boundaries
- Build and maintain validation infrastructure, including software tools, automation, scripts, and lab setups
- Validate interactions between multiple GPU SoC features and subsystems in complex datacenter configurations
- Collaborate with silicon, firmware, driver, platform, and PSO teams to improve validation strategy, methodology, and coverage
- Drive technical innovation in security and RAS validation through tools, scripts, and process improvements
- Support customer platforms and engagements in collaboration with customer support and program teams
- Provide clear execution status, risk assessment, and issue updates to program management
- Mentor junior engineers and lead technical initiatives within the validation team
PREFERRED EXPERIENCE:
- 7+ years of experience in SoC validation, silicon bring‑up, or system‑level debug
- Strong background in post‑silicon validation and debug methodologies at SoC and system level
- Experience with security IP design/validation, fuse programming, and security feature enablement
- Strong programming and scripting skills (C/C++, Python; Perl a plus)
- Solid understanding of computer hardware architecture (GPU/CPU, x86, PCIe, memory, interconnects)
- Working knowledge of firmware, BIOS, drivers, and OS interactions
- Experience with Linux and/or Windows Server environments
- Hands‑on experience with lab equipment (logic analyzers, protocol analyzers, oscilloscopes, etc.)
- Strong communication skills and ability to operate effectively in cross‑functional, cross‑site teams
- The ideal candidate will have demonstrated experience in overall platform security and/or device security microarchitecture and/or CPU, SOC, or hardware security-oriented definition
- Strong understanding and experience with CPU confidential computing technologies (e.g., AMD SEV, Intel TDX, ARM CCA) and trusted execution environments (TEEs)
- Familiarity and understanding of confidential computing concepts such as remote attestation and sealing
- Expertise in operating systems and virtualization technologies, especially around Linux kernel and driver level development, hypervisors/virtual machines, and containers. Experience with SaaS, PaaS, IaaS and container systems like Kubernetes a plus.
- Familiarity with typical software/hardware interfaces and driver techniques.
- Experience designing and/or implementing secure systems, including concepts such as threat modeling, security architecture, protocol design, network security, and operating system security.
- Familiarity and experience in Data Center interconnect standards and associated underlying technologies and protocols
- Experience with confidential computing and isolation technologies (e.g., GPU/CPU secure modes, TEEs, attestation)
- Knowledge of GPU and PCIe virtualization technologies (SR‑IOV, SIOV, vGPU, MIG)
- Familiarity with RAS features, error injection, and reliability validation
- Experience with security threat modeling, vulnerability analysis, or SDL practices
- Understanding of cryptographic primitives and secure boot / root of trust concepts
- Exposure to PCIe and CXL standards and associated security considerations
- Experience supporting customer platforms and field issue debug
- Prior experience mentoring engineers or leading technical initiatives
ACADEMIC CREDENTIALS:
- Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field
LOCATION:
Penang, Malaysia
#LI-CY
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's “Responsible AI Policy” is available here.
This posting is for an existing vacancy.
Apply on company website