POD Test Engineer - IB4
Houston, TX
Full Time
Mid Level
POD Test Engineer
Position Summary
The POD Test Engineer is responsible for supporting, maintaining, and improving server rack test operations within the POD environment. This role ensures test readiness, production stability, failure analysis, process optimization, and continuous improvement of system-level testing for AI and Data Center products. The engineer serves as the technical interface between Manufacturing, Quality, Facilities, Software, and Customer Engineering teams to achieve production and quality targets.
Key Responsibilities
Production Test Support
Position Summary
The POD Test Engineer is responsible for supporting, maintaining, and improving server rack test operations within the POD environment. This role ensures test readiness, production stability, failure analysis, process optimization, and continuous improvement of system-level testing for AI and Data Center products. The engineer serves as the technical interface between Manufacturing, Quality, Facilities, Software, and Customer Engineering teams to achieve production and quality targets.
Key Responsibilities
Production Test Support
- Support daily POD operations and ensure maximum test station uptime.
- Monitor production performance, throughput, cycle time, and first pass yield (FPY).
- Provide technical support for AST, FLA, FCT, FC2, IOT, IST, FPF, NVL, ORT, and other validation processes.
- Ensure test systems, racks, fixtures, and infrastructure remain operational and available.
- Perform root cause analysis of test failures.
- Debug hardware, software, network, and infrastructure-related issues.
- Analyze logs, diagnostic reports, BMC data, and test results.
- Drive corrective and preventive actions (CAPA) to improve yield and reliability.
- Support customer escalations and provide technical updates.
- Identify opportunities to improve test coverage, cycle time, and production efficiency.
- Develop and implement automation solutions for reporting, monitoring, and diagnostics.
- Drive continuous improvement initiatives focused on quality and operational excellence.
- Participate in DOE activities and process validation.
- Support NPI builds, engineering validation, and manufacturing ramp activities.
- Validate test procedures, software releases, fixtures, and station configurations.
- Collaborate with cross-functional teams to ensure manufacturing readiness.
- Create and maintain SOPs, work instructions, troubleshooting guides, and engineering reports.
- Generate daily, weekly, and monthly performance reports.
- Track key metrics including FPY, APY, LPY, cycle time, downtime, and failure paretos.
- Present technical findings to management and customers.
- Train technicians and junior engineers on test procedures and troubleshooting methodologies.
- Provide technical leadership during critical production events.
- Support staffing, shift handovers, and escalation processes.
- Bachelor's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, Computer Science, or related field.
- 3+ years of experience in Manufacturing Test Engineering, System Validation, NPI, or Data Center operations.
- Experience with server hardware, networking, Linux environments, and manufacturing test systems.
- Strong troubleshooting and root cause analysis skills.
- Experience with Python, Bash, PowerShell, or automation tools is preferred.
- Knowledge of manufacturing processes, quality systems, and failure analysis methodologies.
- Experience with AI servers, GPU platforms, liquid cooling systems, or high-performance computing products.
- Experience with NVIDIA, Data Center, Rack Integration, or System Level Testing.
- Familiarity with SFC, MES, BMC, IPMI, PXE boot, and network infrastructure.
- Knowledge of statistical analysis, yield improvement, and process control methodologies.
- First Pass Yield (FPY)
- Average Pass Yield (APY)
- Test Station Uptime
- Production Throughput
- Cycle Time Reduction
- Escaped Defects
- Customer Satisfaction
- Corrective Action Closure Rate
- Fast-paced manufacturing and data center environment.
- Support of multiple shifts, including occasional off-hours and weekend activities.
- Interaction with customers, suppliers, and cross-functional engineering teams.
- Ability to work around server racks, power distribution systems, and liquid cooling infrastructure.
Apply for this position
Required*