Overview
On Site
Depends on Experience
Contract - W2
Contract - 12 Month(s)
Skills
C
C++
BASH
AWK
PERL
PYTHON
HPC
Job Details
Job Title: Cloud Platform Architect/HPC Systems Architect
Duration: 12 Months
Location: Mountain View, CA Hybrid
Job Overview
We are seeking an experienced Cloud Platform/HPC Systems Architect to lead our organization s strategic direction in cloud adoption and high-performance computing (HPC) systems. This role involves designing and implementing scalable cloud and HPC solutions, monitoring system performance, and supporting advanced computing environments. The architect will drive cloud strategy, ensure optimal performance of high-performance computing resources, and provide technical guidance to our infrastructure team.
Duties and Responsibilities
- Cloud and HPC Strategy Development: Provide strategic direction for the organization s cloud adoption, drive design, and oversee the implementation of cloud and HPC solutions to meet organizational objectives.
- System Support & Optimization: Install, integrate, and manage high-performance computer systems, clusters, operating systems, peripherals, and interfaces; monitor usage to maintain optimal system performance and reliability.
- Configuration & Tuning: Configure and tune batch queuing systems in parallel production environments; gather and analyze system utilization statistics to identify and resolve issues.
- Technical Collaboration: Work with computational professionals and users to evaluate requirements and configure/deploy cloud infrastructure tailored to meet specific needs.
- System Analysis & Problem Solving: Independently address complex issues, analyze operational needs, and devise integrated solutions for cloud, security, and hardware/software implementations.
- Trends & Future Proofing: Maintain awareness of trends in cloud computing and HPC, ensuring scalable, future-proofed architecture for on-premise and cloud-based clusters.
- Other Duties: Perform miscellaneous job-related tasks as required.
Minimum Job Requirements
- Education: Bachelor s degree in a related field.
- Experience: 5-7 years of experience in cloud and HPC system design, with strategic leadership experience in medium to large organizations.
- Cloud & HPC Skills:
- Proven expertise in designing, implementing, and supporting IT cloud solutions (AWS or Google Cloud Platform preferred).
- Knowledge of Infrastructure as Code (IaC), containerization, and configuration management tools.
- Strong familiarity with HPC systems, scalable parallel architectures, and Linux OS.
- Technical Proficiency:
- Advanced understanding of data storage technologies, high-speed network interfaces, and hybrid cloud environments.
- Proficiency in high-level programming languages (e.g., C, C++, Fortran).
- Working knowledge of scripting languages (e.g., csh, Bash, Awk, Perl, Python).
- Experience with installation and configuration of operating systems and applications.
- Skilled in complex problem resolution, system testing, and evaluation methods.
- Collaboration & Communication: Excellent communication skills and ability to work effectively within technical and business teams, facilitating requirements gathering and planning.
- Analytical Skills: Ability to interpret complex issues and operational needs, providing creative, integrated solutions.
Soft Skills
- Independent Judgment & Problem Solving: Demonstrated ability to use independent judgment to address complex tasks and resolve issues.
- Team Collaboration: Proven experience working within IT infrastructure teams, collaborating on large projects, and providing mentorship to junior HPC engineers and technical staff.
- Project Involvement: Participate in IT infrastructure team activities to deploy, configure, and manage large parallel, compute, storage, and system software components primarily within cloud environments.
- Adaptability & Leadership: Ability to guide the organization in technology decisions and mentor team members in HPC systems engineering.
Preferred Qualifications
- Experience in Cloud-First Design: Ability to gather requirements, plan, and design cloud-first solutions.
- Industry Awareness: Deep understanding of current trends in cloud and HPC systems, ensuring scalable architecture aligned with the latest technologies.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.