Job ID: 04-284

Description: Provides expertise in integrating, installing, configuring, upgrading, compiling and supporting COTS/GOTS software in a heterogeneous operating system environment. 

The Level III Software Integration Engineer (SIE) shall possess the following capabilities:

  • Ability to integrate, install, configure, upgrade, compile, and support COTS/GOTS software.
  • Generate documentation for the full software stack.
  • Update software for sustainment support.
  • Basic Linux system administration skills and shell scripting.
  • Execute test codes for characterization of software performance.
  • Provide software product ownership for HPC tools. 
  • Working knowledge of CM tools, web documentation, and issue tracking.
  • Ability to work in a fast paced environment and switch between various architectural paradigms.

SIE-3 Qualifications: Bachelor’s Degree in Computer Science or related field and have at least eight (8) years of demonstrable experience with integrating, installing, configuring, upgrading, compiling, and supporting COTS/GOTS software in a heterogeneous operating system environment.

OR

The individual shall have five (5) years full time Computer Science directly related work that can be substituted for a degree and have at least eight (8) years of demonstrable experience.

OR

An industry recognized professional certification, as defined in the TT0s, may substitute as one (1) year experience. A Master’s Degree in Computer Science or related field may substitute for two (2) years’ experience.

Position Description

The Software Integration Engineer shall provide systems integration expertise in installing, integrating, configuring, monitoring, and sustaining various COTS/GOTS/FOSS software, packages, libraries, and storage systems in an HPC Linux environment. 

Position Required Skills

  • Experience using the Linux CLI
  • Experience performing System Administration tasks including installation, configuration, and sustainment of various COTS/GOTS software in a large scale HPC Linux environment
  • General HPC technical knowledge regarding compute, network, memory, and storage components
  • Experience deploying and sustaining distributed file systems such as Lustre, GPFS, and HDFS
  • Experience with IaC concepts, principles, best practices and automation tools including SaltStack
  • Experience with various Commercial-Off-The-Shelf (COTS) software frameworks used for searching, monitoring, and analyzing big data such as Splunk.
  • Experience with open source monitoring, visualization, and alerting technologies including Prometheus metrics collector and Grafana dashboarding.

 Position Desired Skills:

  • Familiar with various network communications such as IP and InfiniBand (IB)
  • Familiar with Site Reliability Engineering (SRE) principles and concepts
  • Experience scripting with Python in a Linux environment
  • Experience developing test plans, procedures, and reports 
  • Experience with the Atlassian Tool Suite (JIRA, Confluence)

Apply For This Job

A valid phone number is required.