Site Reliability / DevOps Engineer
ITTConnect
Paranã, TO - há 3 horas
Descrição do trabalho
ITTConnect is seeking a Site Reliability Engineer / DevOps, with experience in Java applications to work remotely for a client in the US. This is a position with a global leader in consulting, digital transformation, technology and engineering services present in nearly 50 countries. The end client is in the Telecom area (internet, mobile, cable TV provider). Initial assignment until December.

Job location: Remote, work from home anywhere in Brazil, following US Eastern time.

All interviews will be in English only (must be professionally fluent).

Responsibilities:
  • Production support: Triage of production systems and understanding of analysis and how to drive toward resolution while participating in large group discussions.
  • Perform impact analysis on infrastructure and applications.
  • Identify, prepare, execute mitigation plans.
  • Perform production deployments either manually or through automation.
  • Perform required deployment verifications after application or services post deployments.
  • Take deep dives in Java code to identify possible fixes for production issues.
  • Troubleshooting production issues and driving the bridges
  • Work as a contributing team member together with other team members in other states and countries.


Requirements:
  • At least 5 years’ experience in supporting JAVA Application / services hosted in Linux environments.


Kubernetes:
  • Ability to perform cluster level administration on K8s platform.
  • Creating and maintaining scripts to maintain, monitor and alerts on K8s platform.
  • Comfortable with kubectl / YAML


Docker:
  • Understanding of docker.
  • Good experience in writing docker files.
  • Creating images, maintaining docker registry.


Ansible:
  • Playbook creation for repeatable tasks
  • Perform installation of software (platform and code deployment)
  • Take documentation and create roles to install software.
  • Create reusable roles and playbooks
  • CloudFormation and Terraform


Monitoring Tools:
  • New Relic - APM, Insights, Infrastructure
  • Ability to create alerts and dashboards.
  • Splunk - Querying and dashboard creation


Application Performance Tuning:
  • Participate in load testing and resolve testing bottlenecks.
  • Java heap and thread dump analysis


Jenkins:
  • Knowledge and understanding of Jenkins Pipelines
  • Ability to analyze console job log for errors.


OS Support and Troubleshooting:
  • Redhat Enterprise Linux
  • Amazon Linux
  • Alpine Linux
  • Windows 2012/2016


Nice to have tools & technologies:
  • Kubernetes Platform: Rancher, GCP
  • Hashicorp Tools: Vault, Consul
  • Middleware: JBoss EAP, JBoss Fuse
  • Elastic Search – implementation and troubleshooting
  • AWS Services: EC2, Lambda, RDS, VPC, IAM Roles, DynamoDB
Trabalhos semelhantes que você ainda não viu:
Site Reliability / DevOps Engineer
ITTConnect, Paranã, TO

ITTConnect is seeking a Site Reliability Engineer / DevOps, with experience in Java applications to work remotely for a client in the US. This is a position with a global leader in consulting, digital transformation, technology and engineering services present in nearly 50 countries. The end client is in the Telecom area (internet, mobile, cable TV provider). Initial assignment until...

DevOps - SRE (Site Reliability Engineering)
Encora Inc., Paranã, TO

Description Important Information Location: Brazil Job Mode: Full-time Work Mode: Work from home Job Summary Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Responsibilities and Duties Utilize...

DevOps - SRE (Site Reliability Engineering)
Encora Inc., Paranã, TO

Description Important Information Location: Brazil Job Mode: Full-time Work Mode: Work from home Job Summary Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Responsibilities and Duties Utilize...

DevOps - SRE (Site Reliability Engineering)
Encora Inc., Paranã, TO

Description Important Information Location: Brazil Job Mode: Full-time Work Mode: Work from home Job Summary Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Responsibilities and Duties Utilize...

DevOps - SRE (Site Reliability Engineering)
Encora Inc., Paranã, TO

Description Important Information Location: Brazil Job Mode: Full-time Work Mode: Work from home Job Summary Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Responsibilities and Duties Utilize...

Site Reliability Engineer / DevOps
Gazin Tech, Paranã, TO

Descrição do Cargo Estamos em busca de um Site Reliability Engineer / DevOps para se juntar à equipe da Gazin Tech. Neste cargo, você será responsável por garantir a confiabilidade e a estabilidade dos sistemas através da automação, monitoramento e resolução de problemas. As tarefas diárias incluem desenvolver e manter infraestrutura, solucionar problemas de sistema e colaborar com equipes de...

Site Reliability Engineer II
Meta, Paranã, TO

O que procuramos?Atribuições principais:Prover guia técnico e mentor técnico outros engenheiros;Participar do desenvolvimento e manutenção da infraestrutura Cloud;Colaborar nas decisões técnicas envolvendo arquitetura e infraestrutura (dimensionamento de carga, distribuição de carga, estratégias para cache, etc.);Ser responsável por monitoramento e observabilidade dentro dos clusters...

Senior Site Reliability Engineer (SRE)
Meta, Paranã, TO

O que procuramos?Atribuições principais: Prover guia técnico e mentor técnico outros engenheiros;Participar do desenvolvimento e manutenção da infraestrutura Cloud;Colaborar nas decisões técnicas envolvendo arquitetura e infraestrutura (dimensionamento de carga, distribuição de carga, estratégias para cache, etc.);Ser responsável por monitoramento e observabilidade dentro dos clusters...