JOB TITLE: Associate Site Reliability Engineer
FLSA STATUS: Exempt
DEPARTMENT: Technology
REPORTS TO: Director, Information Technology
SUPERVISORY RESPONSIBILITIES: No
JOB LOCATION: Nashville, TN – Corporate Office/Remote
TRAVEL: None
ESSENTIAL DUTIES & RESPONSIBILITIES:
- Monitor the health and performance of production systems and infrastructure.
- Respond to and investigate incidents to ensure timely resolution and minimize downtime.
- Develop and maintain monitoring, alerting, and automation tools to improve system reliability and efficiency.
- Collaborate with software engineers to design and implement scalable and reliable solutions.
- Participate in capacity planning and performance tuning activities to support business growth.
- Contribute to the design and implementation of disaster recovery and failover strategies.
- Document processes, procedures, and best practices to facilitate knowledge sharing and training.
- Stay current with industry trends and best practices in site reliability engineering and related technologies.
- Serve as a primary point of contact for all infrastructure supporting our enterprise applications
- Engage and assist our security team to provision and maintain a secure environment for our applications
- Execute quarterly security scans and work with security and development teams to resolve
- Create and maintain a portfolio of templates, scripts, and configurations for developers to use when building out new environments or applications
- Work with our vendors to help maintain our dedicated environment
MINIMUM QUALIFICATIONS (EDUCATION AND EXPERIENCE):
- 1-3 years’ experience in Site Reliability Engineering or similar engineering role
- Practical experience with device and application monitoring tools (ex: Site24x7, New Relic, CloudWatch, GuardDuty, etc.)
- Experience using configuration management tools such as Chef or Puppet
- Strong sense of ownership over your work
- Determination to follow issues to resolution both with and without guidance
- Practical experience with coding languages such as Go, Python, or Javascript/Typescript
PREFERRED QUALIFICATIONS (EDUCATION AND EXPERIENCE):
- Experience migrating applications from dedicated environments to cloud-based providers
- AWS certification (Solutions Architect, SysOps Administrator, DevOps Engineer)
- Experience setting up and securely configuring web servers such as Apache, Nginx, and IIS
- Experience creating and maintaining pipelines within CI/CD tools such as Azure DevOps, AWS CodeDeploy, or BitBucket.
- Knowledge creating and maintaining containerized applications
Required
-
1 - 3 years: Site Reliability Experience