Google Play

IT Site Reliability Engineer

Publiée le 05.10.2022

Entreprise

Site Internet : https://about.gitlab.com/
Secteur d´activité :
Informatique, SSII, Internet
Recherche et développement
Offres d´emploi : Voir toutes nos annonces
GitLab is the DevOps platform that empowers organizations to maximize the overall return on software development by delivering software faster and... lire la suite

Détails de l'annonce

Poste proposé : IT Site Reliability Engineer

At GitLab, the IT Infrastructure team is responsible for Site Reliability Engineering for our tech stack applications and cloud infrastructure that supports corporate initiatives across many of our departments. In addition to traditional AWS and GCP administration, we also provide escalation engineering support for departments that manage their respective SaaS tech stack applications (vendor-hosted). Another of our functions is to provide DevOps Engineering for several internally built applications that power our business operations and automation.
The IT team collaborates closely with the Engineering Infrastructure Reliability team that is responsible for our GitLab.com SaaS platform (our product infrastructure). The IT, Engineering, and Infrastructure Security teams collaborate to architect, implement, and manage our AWS and GCP infrastructure policies and collectively manage all related services.
Responsibilities

  • Lead the handling of ticket queue (GitLab issues) for AWS and GCP corporate infrastructure requests from team members. This ranges from simple IAM and DNS requests to designing and deploying new scalable application infrastructure.
  • Design, build and maintain core infrastructure that enables GitLab can scale to support 2,000+ team members and the applications and services that they use day-to-day.
  • Implement and maintain system logging and monitoring to alert on problems and prevent outages, and get ahead of customer needs.
  • Maintain the corporate AWS and GCP infrastructure utilizing Ansible, Terraform, GitLab CI/CD, and Kubernetes
  • Gather and analyze operating system and application metrics to assist in performance tuning and fault finding
  • Create sustainable systems and services through patching, automation, and upgrades
  • Document every action so your findings turn into repeatable actions and then into automation.
  • Provide mentorship to IT System Administrators and IT Analysts who have an interest in infrastructure and IaC.
  • Collaborate with other teams to improve services and help with system design, platform management, and capacity planning
  • LevelsAWS and GCP - At least 2 years managing applications in AWS and/or GCP. An AWS and/or GCP professional certification is nice to have, however practical experience is more important in conjunction with Terraform experience for deploying applications and services using infrastructure-as-code with security best practices.
  • Security - Strong understanding of security best practices, network design, and how AWS/GCP roles should be used for IAM/RBAC least privilege.
  • Infrastructure-as-Code - Configuration management experience with Terraform and/or Ansible to effectively manage our infrastructure. Previous experience with AWS
  • CloudFormation, Chef, Pulumi, Puppet, etc. is acceptable, however strong Terraform experience is a requirement.
  • Kubernetes - Experience with managing Kubernetes clusters and using kubectl, k9s, etc for managing helm chart deployments, ingress services, and troubleshooting pods.
  • Previous experience with Docker and related technologies is acceptable since container concepts are transferable.
  • Operating Systems - Experience with managing Alpine, Debian, or Ubuntu Linux systems. We do not use Windows at GitLab. Many services are deployed in containers.
  • Cloud Services - Manage, configure and troubleshoot Linux operating system issues (Linux), storage (block and object), networking (VPCs, proxies and CDNs), and administer high-availability PostgreSQL and Redis clusters
  • Monitoring and instrumentation - Implement metrics in Prometheus, Grafana, Elastic, log management and related systems, and Slack/PagerDuty/Sentry integrations
  • Engineering practices - High availability, data security, reliability and scalability, as well as disaster recovery
Profil recherché pour le poste : IT Site Reliability Engineer

Requirements

  • 5+ years of experience in IT in a high growth Software as a service (SaaS) environment
  • Knowledge of configuration management tools like Ansible, Chef, or Terraform
  • Hands-on experience working in GCP and AWS environments
  • Experience working with CI/CD tools and Git
  • Ability to use GitLab
Critères de l'annonce pour le poste : IT Site Reliability Engineer
Métier :
Informatique, nouvelles technologies
Secteur d´activité :
Informatique, SSII, Internet
Recherche et développement
Type de contrat :
CDD
Région :
Boké - Conakry - Faranah - Kankan - Kindia - Labé - Mamou - Nzérékoré - International
Niveau d'expérience :
Expérience entre 5 ans et 10 ans
Niveau d'études :
Bac+3
Nombre de poste(s) : 4
Soyez vigilant ! N'envoyez pas d'argent à un employeur potentiel. Ne versez aucune somme d'argent en échange d'un contrat de travail potentiel ou pour suivre une formation préalable à l'embauche.
Merci de signaler toute irrégularité en utilisant le formulaire de contact candidat et en sélectionnant l'objet "Signaler une annonce d'emploi".

Partagez cette offre avec votre réseau

Autres offres d’emploi susceptibles de vous intéresser

IT Senior Internal Auditor

06/12/22 | SOLERA, INC.

The Role The Global Audit and Risk Services department is currently looking for an IT Senior Internal Auditor to document, review and evaluate internal controls over financial reporting (Sarbanes-Oxley, “SOX”) and conduct internal control reviews focused ...

Région de : Boké - Conakry - Faranah - Kankan - Kindia - Labé - Mamou...

ANGULAR Developer (M/F)

05/12/22 | AFRICASHORE

We are looking for ANGULAR Developer profiles / CVs to pass on to our clients for their future recruitment campaigns.   Mission:   Working within a Team to determine needs and applying / customizing existing technology to meet those needs. Producing detai ...

Région de : Boké - Conakry - Faranah - Kankan - Kindia - Labé - Mamou...

DevOps Engineer (M/F)

05/12/22 | AFRICASHORE

We are looking for DevOps Engineer profiles / CVs to pass on to our clients for their future recruitment campaigns.   Mission:   Ensure effective Design, Development, Validation and Support activities. Defining and setting development, test, release, depl ...

Région de : Boké - Conakry - Faranah - Kankan - Kindia - Labé - Mamou...

DOTNET Developer (M/F)

05/12/22 | AFRICASHORE

We are looking for DOTNET Developer profiles / CVs to pass on to our clients for their future recruitment campaigns.   Mission:   Working within a Team to determine needs and applying / customizing existing technology to meet those needs. Producing detail ...

Région de : Boké - Conakry - Faranah - Kankan - Kindia - Labé - Mamou...