bigshyft
AAppViewX
AppViewX
Lead Site Reliability Engineer
Series B
Start-up
201-500 employees
7y - 13y

(Competitive pay)

Coimbatore
Linux, AWS, Ansible, Kubernetes, DevOps

Role

Company

Job Description

What you'll do:

  • Provide leadership and guidance, act as subject matter expert, and foster the use of best practices.
  • Oversee the involvement of the SRE team in the SDLC to ensure the performance, scalability, and reliability of our services.
  • Work with engineering, QA, and other teams in the architecture and implementation of Internet-scale services.
  • Create and review requirements specifications; evaluate solutions and designs, assess implementations,
  • Lead the SRE projects and tasks including OAM tasks with onus on optimization.
  • Coach and mentor the SRE team to improve their knowledge and expertise, and the quality engineering deliverables, and provide recommendations to ensure the dependability of our services.
  • Build the automation for large scale OAM of our systems and services.
  • Prevent incidents that could impair the operational readiness of our systems and services.
  • Set up logging and monitoring systems that alert on symptoms instead of outages.
  • Improve the performance, scalability, reliability, quality, and time-to-market of our suite of software solutions.
  • Improve our operational processes (deployment, onboarding, upgrades, decommission, etc.) with automation reducing to a minimum while supporting (rare) human intervention.
  • Assist with the architecture and implementation of Internet-scale services.
  • Support the production SaaS environment by monitoring key performance indicators and taking a holistic view of system health.
  • Debug production issues across all tiers, layers, and components of our applications and services.
  • Support of the production SaaS environment by monitoring key performance indicators and taking a holistic view of system health.
  • Be responsible for documenting cases to reflect the actions taken, informing customers of problem status and providing updates and solution(s) in professional and timely fashion, over the lifetime of the support request.
  • Take ownership of customer issues when escalated by customer management. Drive to resolve issues effectively, escalating cases to development teams where necessary.


What makes you a great fit:

  • 7+ years of experienced covering DevOps, Software Development, SRE.
  • Experienced in Linux System Administration and Networking.
  • Strong programming skills: Linux Shell and Python.
  • Experienced with configurations management systems/tools (Ansible), infrastructure as code (CloudFormation, Helm, Terraform), version control (Git), and CI/CD (Jenkins).
  • Solid experience in Cloud Computing (AWS preferably, GCP, and Azure)
  • Solid experience in container technology (Docker, containerd), container orchestration (Kubernetes), service meshes (Istio), observability (Prometheus, Jaeger), visualization (Grafana), event logging and alert management (Splunk, Elasticsearch)
  • Strong understanding of IT and Security best practices, controls, regulations, standards, tools, etc.
  • Data mining experience
  • Strong understanding of the WWW architecture and technologies,
  • Strong understanding of DBs (SQL and NO SQL).
  • Experienced in the delivery of IaaS, and/or PaaS, and/or SaaS.
  • Project management background and experienced leading large technology implementations.
  • Intense dedication to the performance, scalability, and reliability of applications and systems.
  • Significant experience in DevOps and SRE adoption and evolving practices.
  • Quick, effective, efficient.
  • Strong leadership skills.
  • Strong communicator, able to lead and facilitate discussions across many tiers including business, architecture/design, engineering, DevOps, operations, etc.
  • Experienced in 24x7 Operations.
  • Exposure to Azure and other cloud platforms
  • Experience in technical engineering / design of SaaS environments is a Plus.
  • Experience in CI-CD technologies such as Jenkins is a Plus.
  • Hands on coding abilities in Python and Terraform is a Plus.
All about us
AppViewX

The world runs on applications. “Yes, we have an app for that” is now part of our daily lexicon whether as consumers or employees. So, while application availability, access, security and compliance are critical to our user experience and productivity, why do we continue to see application downtime? The answer is simple…too many cooks in the proverbial kitchen. DevOps teams are doing this. NetOps teams are doing that. SecOps teams are doing something different. Even though all of them have one essential goal: to ensure application access and availability, security and compliance. Enterprise organizations depend on this to deliver reliable, resilient, and secure applications to support customers, business growth and revenue goals. Is there a better way? Yes! The AppViewX Platform aligns cross functional teams with self-service workflow orchestration to accelerate and optimize application delivery with security and compliance built-in. With role-based access controls and self-servicing functionality, teams can work in concert with each other throughout the application delivery process to provision and manage digital identities at scale, configure and control application infrastructure processes, and create and enforce policies that meet internal and external compliance requirements. While other vendor tools just automate tasks, we unite enterprise development, IT and InfoSec teams to eliminate manual processes and errors, reduce service tickets from days to minutes, streamline secure application delivery and facilitate digital transformation from one central control plane, The AppViewX Platform!

Employee count
201-500 employees
Employment Type
Full Time Job
Company Type
Start-up
Headquarters
New York City, New York, United States
Perks & facilities
Option to 'work from home'
Pick-up/drop facility
Health care benefits

Apply to Similar Jobs

  • AAppViewX
    AppViewX
    SRE Lead
    Series B
    Start-up
    201-500 employees
    8y - 10y
    ₹25 - ₹35 LPA
    Coimbatore
    Ansible, Terraform, AWS, Kubernetes, Linux