Reliability Engineer

Houston, TX
Full Time
Entry Level

Position Summary

We are seeking a proactive and results-oriented Reliability Engineer to join our Server R&D team. In this role, you will focus on ensuring the long-term reliability and performance of server and rack systems through rigorous validation, failure analysis, and cross-functional collaboration. The ideal candidate is experienced in reliability test methodologies and passionate about improving product quality from early development to mass production.

Key Responsibilities

  • Plan and execute reliability validation strategies across system levels (Server/Storage/Rack), including thermal cycling, vibration, drop, operational stress, and HALT/HASS.
  • Lead design reviews with EE, ME, Power, Thermal, and BIOS teams to identify and mitigate potential reliability risks early in the development cycle.
  • Conduct system-level integration testing to validate hardware/software compatibility, stability, feature completeness, and long-term operational reliability.
  • Perform root cause analysis (RCA) and corrective actions for reliability and quality issues; provide design feedback to improve future iterations.
  • Define and maintain reliability test specifications aligned with industry standards (e.g., Telcordia, GR-63, JESD22, MIL-STD-810).
  • Create and maintain test plans, procedures, and technical reports; present findings to internal stakeholders and external customers.
  • Collaborate closely with global R&D centers (e.g., Taipei, Tianjin) and support customer audits and reviews as needed.
  • Operate and calibrate reliability test equipment; ensure lab safety, equipment integrity, and data traceability.

Qualifications

  • Bachelor’s or Master’s degree in Mechanical, Electrical, or Industrial Engineering, or a related technical field.
  • Minimum of 2 years of experience in server, PC, notebook, or data center product testing or reliability engineering.
  • Proficiency with Windows and Linux OS installation and basic command-line operations.
  • Hands-on experience operating environmental and reliability testing equipment (e.g., thermal chambers, vibration/shock testers, power cycling systems).
  • Strong analytical and problem-solving skills; familiarity with FA/RCA tools and debugging methods.
  • Excellent verbal and written communication skills in English; Mandarin proficiency is a plus.
  • Experience with external testing labs, certification processes, and documentation best practices is preferred.

Preferred Experience

  • Knowledge of data center environmental standards and server architecture.
  • Experience working with global customers and supporting OEM/ODM projects.
  • Familiarity with failure analysis and working closely with design teams for corrective actions.
Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.
Human Check*