Sr. HPC Application Performance Engineer- Hybrid/Telecommute
Company: Varada Consulting
Location: Mountain View
Posted on: July 31, 2022
|
|
Job Description:
Sr. HPC Application Performance EngineerClearance: Ability to
obtain a Public TrustJob Location: Hybrid-onsite (Mountain View,
CA) OR Remote, US
Overview:Varada Consulting proudly supports NASA's High Performance
Computing Services program in Mountain View, CA at the Ames
Research Center and in Greenbelt, MD at Goddard Space Flight
Center. Make a DIFFERENCE on a program that supports 4 On-site
Supercomputers 18,000+ nodes, 17+ combined petaflop supercomputer
systems.
We have an immediate position for a Senior HPC Application
Performance Engineer to join our Application Performance and
Productivity Group (APP) team. This role supports the hundreds of
infrastructure systems that support HPC operations. As a member of
the APP group, an individual at this skill level should have
demonstrated experience working with users of HPC resources on the
various issues they might have getting applications to run
efficiently. The abilities to interact with users to determine
symptoms and then reproduce their issues in order to isolate
thecauses are critical skills for this work. After a period of
acclimatization, the person filling this role would be expected to
assume Point of Contact (POC) duties on a regular schedule. Besides
the POC role, there will be activities in testing, benchmarking,
user tool scripting, and analyzing trouble tickets to find patterns
indicating system or user education issues. A significant portion
of the job duties will be doing performance analysis and
optimization of user applications.
Responsibilities: - Consult and support our user base with the most
challenging aspects of scientific computing - Acts as one of
several escalation points for the group's Tier 2 Points of Contact
(POC) for handling of group's level-2 trouble tickets. Issues can
include - Porting, scaling, optimization, and debugging of
application codes and runtime libraries - Assisting new users with
issues such as logging in and setting up their environments Helping
users with job scripts, including parallelization when indicated -
Using applications and benchmarks, conducts system testing-both
proactively and in response to requests. - Designs and develops
scripts for the automation of testing and benchmarking. - Designs
and develops scripts to improve user productivity and system
utilization. - Works with users to understand their current and
future requirements for applications and workflows. - Evaluates
tools for improving user workflow productivity and application
performance. - Monitors trouble ticket trends and develops
knowledgebase articles and training modules to address common
issues.
Requirements: - Master's degree (or equivalent experience) in
computer science, science, engineering, or related field - 5 years
of HPC specific software optimization and consulting experience
(not including as a user of HPC) - Experience with performance
analysis tools - Experience with Message Passing Interface (MPI)
programming - Experience working with HPC systems - Experience
working with HPC applications and proficiency in at least one of C,
C++, or Fortran - Superior scripting skills and excellent attention
to detail; proficiency in at least one of Python, Perl, or Bash -
Good organization skills to balance and prioritize work, and
ability to multitask - Ability to work in a hybrid remote/onsite
team environment - Strong ability to interact with customers to
understand needs, elicit requirements, and get feedback on
prototype solutions - Excellent communication and people skills,
time management, and organizational skills
Preferred Skills: - Proficiency at technical writing - Proficiency
with analysis and problem-solving skills for debugging and
optimization of applications - Proficiency working with HPC systems
and scheduling software (PBS, Slurm, or Moab/Torque) - Proficiency
with C, C++, Fortran, Perl, Python, Bash - Proficiency with OpenMP
- Proficiency with revision control software e.g., CVS, SVN, Git -
Experience with HPC, Lustre, and InfiniBand
Join an Award - Winning Team! Voted as Most Innovative and Fastest
Growing Company, Varada Consulting offers highly customized IT
capabilities in the federal civilian and DoD market space in
support of the mission objectives of the federal government. Varada
provides competitive compensation and benefits packages including
100% employer paid healthcare premium.
Varada Consulting, LLC is an Equal Employment Employer. All
qualified applicants will receive consideration for employment
without regard to race, color, religion, sex, sexual orientation,
gender identity, national origin, disability, or veteran
status.
Keywords: Varada Consulting, Mountain View , Sr. HPC Application Performance Engineer- Hybrid/Telecommute, Engineering , Mountain View, California
Click
here to apply!
|