Reliability Engineer
Houston, TX
Full Time
Entry Level
Position Summary
We are seeking a proactive and results-oriented Reliability Engineer to join our Server R&D team. In this role, you will focus on ensuring the long-term reliability and performance of server and rack systems through rigorous validation, failure analysis, and cross-functional collaboration. The ideal candidate is experienced in reliability test methodologies and passionate about improving product quality from early development to mass production.
Key Responsibilities
- Plan and execute reliability validation strategies across system levels (Server/Storage/Rack), including thermal cycling, vibration, drop, operational stress, and HALT/HASS.
- Lead design reviews with EE, ME, Power, Thermal, and BIOS teams to identify and mitigate potential reliability risks early in the development cycle.
- Conduct system-level integration testing to validate hardware/software compatibility, stability, feature completeness, and long-term operational reliability.
- Perform root cause analysis (RCA) and corrective actions for reliability and quality issues; provide design feedback to improve future iterations.
- Define and maintain reliability test specifications aligned with industry standards (e.g., Telcordia, GR-63, JESD22, MIL-STD-810).
- Create and maintain test plans, procedures, and technical reports; present findings to internal stakeholders and external customers.
- Collaborate closely with global R&D centers (e.g., Taipei, Tianjin) and support customer audits and reviews as needed.
- Operate and calibrate reliability test equipment; ensure lab safety, equipment integrity, and data traceability.
Qualifications
- Bachelor’s or Master’s degree in Mechanical, Electrical, or Industrial Engineering, or a related technical field.
- Minimum of 2 years of experience in server, PC, notebook, or data center product testing or reliability engineering.
- Proficiency with Windows and Linux OS installation and basic command-line operations.
- Hands-on experience operating environmental and reliability testing equipment (e.g., thermal chambers, vibration/shock testers, power cycling systems).
- Strong analytical and problem-solving skills; familiarity with FA/RCA tools and debugging methods.
- Excellent verbal and written communication skills in English; Mandarin proficiency is a plus.
- Experience with external testing labs, certification processes, and documentation best practices is preferred.
Preferred Experience
- Knowledge of data center environmental standards and server architecture.
- Experience working with global customers and supporting OEM/ODM projects.
- Familiarity with failure analysis and working closely with design teams for corrective actions.
Apply for this position
Required*