View All Jobs 28183

Senior Implementation Specialist

Lead the end-to-end GPU-enabled Linux HPC deployment for a client and ensure successful on-site implementation
Texas, United States
Senior
22 hours agoBe an early applicant
Apex Systems

Apex Systems

Provides IT staffing, consulting, and workforce solutions, connecting organizations with technology talent and project-based services.

14 Similar Jobs at Apex Systems

Senior Implementation Specialist

We are seeking a Senior Implementation Specialist for a service delivery role. This position requires deep technical expertise in deploying and managing advanced server and networking hardware. The specialist will handle GPU deployment, system configuration, and performance testing in large-scale Linux environments, working directly with clients to ensure successful implementation and maintenance.

Key Responsibilities

  • Perform cluster-level code upgrades according to approved versions and compatibility guidelines.
  • Manage iDRAC, including configuration, access validation, health checks, troubleshooting, and lifecycle support.
  • Update server, BIOS, NIC, storage, and related firmware, ensuring version alignment and post-update validation.
  • Utilize Redfish APIs for system management, monitoring, customization, and automation.
  • Configure and manage BlueField DPUs.
  • Conduct GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager.
  • Work in Linux-based parallel computing environments at scale.

Required Qualifications

Experience:

  • 7+ years of relevant experience with Red Hat distributions.
  • Deep hands-on experience with GPU deployment and configuration.
  • Experience with GenAI/HPC networking (InfiniBand and/or RoCE).
  • Experience with PowerEdge Rack/Tower hardware.
  • Strong customer-facing and communication skills.

Technical Skills:

  • Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks.

Certifications:

  • NVIDIA certifications.
  • Red Hat certification (RHCSA/RHCE) is strongly preferred.

Preferred Qualifications

  • A Bachelor's degree.
  • Experience with PowerEdge XE servers and NVIDIA QR Switches.
  • Additional NVIDIA certifications (NCA, NCE, DGX).
  • Experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics.
  • Exposure to hybrid cloud or GPU cloud environments.
  • Experience with GPU observability and performance profiling tools.

This employer is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

+ Show Original Job Post
























Senior Implementation Specialist
Texas, United States
Customer Success
About Apex Systems
Provides IT staffing, consulting, and workforce solutions, connecting organizations with technology talent and project-based services.