Senior Site Reliability Engineer (Remote) - Lightci
  • Hamilton, Ontario, Canada
  • via MindMatch.ai
-
Job Description

Role missionAs a Site Reliability Engineer (SRE) serving clients across multiple industries (including edtech, telecommunications, and more), you will work with cutting-edge technologies like AWS, ECS/EKS, and event-based systems, ensuring the reliability, scalability, and performance of our services. If you are passionate about solving complex challenges and making an impact through innovation and collaboration, we would love to hear from you.DeliverablesDesign, deploy, and maintain infrastructure using CDK in AWS environmentsDevelop monitoring solutions and implement incident response processes to ensure high availability and reliabilityImplement and manage containerized applications using ECS/EKSSupport various databases (RDBMS, NoSQL) ensuring optimal performance and reliabilityServe as an architect/AWS SME, lending your expertise to devs as they design scalable solutionsWork closely with development teams to ensure best practices in reliability and performance are followedWrite scripts to automate processes and improve efficiencyPerform DevOps tasks such as CI/CD pipeline management and configuration managementAbout youProficient in TypeScriptProven experience in an SRE role or similar, with hands-on expertise in CDK or Terraform, and AWSExperience managing containers using ECS/EKS in Fargate and EC2 clustersKnowledge of various databases (RDBMS, NoSQL) and their performance tuningExperience with event-based systems and event-sourcing methodologiesExperience with CI/CD pipelines and configuration management toolsStrong analytical and troubleshooting skills with a proactive approach to problem-solvingExcellent communication skills with the ability to collaborate effectively across teamsNice to haveExperience in either EdTech and Telco (preferably both)

J-18808-Ljbffr

;