Job Title: Site Reliability Engineer
Location: San Jose, CA
Duration: Full Time
Job Description:
- Extensive experience working with linux flavors like rhel/centos os, shells, filesystems and utilities
- Knowledge of distributed computing and experience working with container orchestration frameworks including on-prem and rancher Kubernetes and good knowledge on Kubernetes objects
- Experience working with Storage, ONTAP is preferable: volume, aggregates, backups, DR planning
- Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments, validations and monitoring to improve operational tasks
- Experience scheduling monitoring scripts using cron and airflow
- Experience with monitoring tools including Dynatrace, Apica, Grafana etc.
- Database knowledge including sql and NoSQL dbs
- Experience building CICD pipelines (preferred)
- Cloud platform knowledge (specifically AWS) is required Incident handling and problem management
Job Type: Full-time
Work Location: In person