MyPage is a personalized page based on your interests.The page is customized to help you to find content that matters you the most.


I'm not curious
1

Observability Systems Engineer

Location Gurgaon, India
Posted 07-September-2021
Description
The Server Reliability Engineering division is responsible for providing global, robust and innovative trading and research compute platforms to support Towers trading teams. As an Observability Engineer you will be an integral member of the team whose responsibility includes prototyping, designing, developing and supporting highly scalable monitoring, logging, alerting and data analytics platforms based on major open source solutions.


Responsibilities

Be a part of a team of Server Reliability Engineers with a vision of achieving truly scalable monitoring, logging, alerting and data analytics platforms.
Design and implement solutions that enable multiple teams to efficiently extract insights from data. This includes ingestion (web scrapes, FTP sync, sensor collection etc.), transformations (MySQL, Kafka, Python/C/Java etc.), and interface (API, schema design, events etc.)
Develop and manage monitoring, alerting, logging platform to help SRE and Operations teams to quickly pinpoint, isolate and resolve issues related to infrastructure, platform services and applications
Build tools and automation capabilities for data pipelines that improve the efficiency, quality and resiliency of our data platform
Communicate effectively with the DevOps managers on release milestones, sprints and roadmap activities with respect to Observability Engineering
Follow best practices about scale, performance, geo-distribution, multi-cloud, code maintenance, documentation etc
Support on-call rotations for the team as per the business requirements


Qualifications

Early career in the field of system engineering, observability or system administration (2-7 years)
Observability and tooling instrumentation experience (log collection, ETL, visualization, tooling and integration)
Demonstrable understanding of DevOps concepts (CI/CD, build automation, microservices, infrastructure as a code, source control)
Demonstrable basic programming skills in at least Python and Bash
Solid understanding of OS concepts and internals of Linux
Hands-on experience on Kubernetes or any other containerized scheduler
Demonstrable debugging and troubleshooting skills
Formal higher education in a relevant field (software engineering, IT, CompSci, math, data sciences and others)


Additional Qualifications (Preferred):

Experience in Kubernetes as a systems engineer (deployment, troubleshooting, maintenance, Helm charts)
Deployment and administration of one or more of:
ELK stack
Kafka
Prometheus
Grafana

Working knowledge of at least one cloud platform (GCP, AWS or Azure)
Working knowledge of some configuration management system (such as Salt or Ansible)
Database administration knowledge
Good understanding of networking concepts (architecture, components, protocols)
Experience
Min 2 to 7 Years.

 
Awards & Accolades for MyTechLogy
Winner of
REDHERRING
Top 100 Asia
Finalist at SiTF Awards 2014 under the category Best Social & Community Product
Finalist at HR Vendor of the Year 2015 Awards under the category Best Learning Management System
Finalist at HR Vendor of the Year 2015 Awards under the category Best Talent Management Software
Hidden Image Url