Job Description
Job Title: DevOps Engineer
Employment type: Contract
Location: Salisbury, NC
Work ethic - You are a consummate professional.
Aptitude - You have an innate capacity to transition from project to project without skipping a beat.
Communication - You have excellent written and verbal communication skills for coordination across projects and teams.
Impact - You are a critical thinker with an emphasis on creativity and innovation.
Passion - You have the drive to succeed paired with a continuous hunger to learn.
Leadership - You are trusted, empathetic, accountable, and empower others around you
Job Description:
Responsibilities
• Teaching peers about monitoring & observability best practices.
• Guiding & reinforcing proper use of our toolset to improve the quality, reliability & availability of the services our teams offer.
• Implement and enhance monitoring of the hardware & software across our ecosystem.
o Developing and improving instrumentations/integrations.
o Providing guidance on monitoring best practices.
o Providing guidance on monitoring specific hardware & software items (key points to monitor).
• Implement and enhance observability of products & platforms across our ecosystem.
o Developing and improving instrumentation
o Providing guidance on key areas to observe.
o Educating teams on how observability tools work.
• Being responsible for ensuring we provide our internal customers with the best monitoring & observability possible to aid them in raising the quality, reliability & availability of IT corporate infrastructure.
• Scripting / Infrastructure as Code / Process Creation for monitoring & observability implementations & enhancements to lower overhead & improve efficiency.
Requirements:
• Experience with Monitoring / Observability / Site Reliability Engineering
• Engineering degree or equivalent experience and familiarity with engineering best practices.
• Working knowledge of how hardware & software interact in a corporate retail environment.
• Experience with Azure / Azure DevOps
• Deeper knowledge in one or more of the following domains of hardware/software:
o Application Servers (IIS, Tomcat, WebSphere, jBoss, etc)
o Containerization (Kubernetes, VMWare, etc)
o Database (SqlServer, Postgres, DB2, Oracle, etc)
o Message Bus (IBM MQ, Kafka, Active MQ, Rabbit MQ)
o Networking (Cisco ACI, F5 Load Balancers, Firewalls, etc)
o Operating Systems (RedHat, Windows, etc)
o Programming (java, .net, pyton, etc)
o Storage Devices
o Web Servers (apache, nginx, etc)
• Familiar with Agile Scrum process.
• Ability to interact with a variety of personalities and technical skill levels across multiple product & platform teams.
• Proficient in developing and maintaining technical documentation.
Nice to haves:
• Experience with:
o Datadog
o Nagios
o ServiceNow Event Management / Service Operations Workspace
• Knowledge on the Google Site Reliability Engineering model
• Experience with Infrastructure as Code / Configuration Management tools:
o Terraform
o Ansible
o Azure Dev Ops
• Skills in troubleshooting production environments (this is not a day to day responsibility of this role but this experience will prove valuable as we build the tools those teams utilize).
• Strong ownership attitude / track record of taking responsibility.
Job Tags
Permanent employment, Contract work,