View all jobs

Sr. Site Reliability Engineer

Roseland, NJ

Position Description:

The Site Reliability Engineer will play the mission-critical role of ensuring that critical systems are healthy, monitored, automated, and designed to scale. The position will work closely with Project teams, Business Enablement teams, and other critical teams to run the approved project build on field to identify and address issues promptly, particularly in the production environment as well as support reliability initiatives to identify and resolve production issues.
This role will be responsible for Availability Management, Latency Management, building and maintaining automation pipelines for cloud native apps, onboarding applications to Public Cloud, and analyzing cloud-based workloads

The individual must have prior hands-on experience in quick rollout of stable production releases and have knowledge of cloud infrastructure and cloud applications. The Site Reliability Engineer is critical in ensuring that the operational phase of key cloud projects run smoothly

The key responsibilities of the Site Reliability Engineer include:
  • Responsible for creating, adapting and maintaining system configurations for applications which are running on cloud platforms
  • Understand business needs and translate those to cloud-native attributes like scalability, reliability, performance and on-demand service
  • Leverage industry acknowledged software tools and methodologies and make use of them in cloud native use cases
  • Simplify the process of providing instances of organization’s products by leveraging open source solutions and automation pipelines
  • Responsible for creation and deployment of detailed cloudformation stacks throughout the product lifecycle.
  • Documentation of newly deployed environments for ongoing operational support
  • Turnover of knowledge, documents and IAC to operations team for deployment of code to Production accounts
  • Deployment and troubleshooting of code in lower tier environments
  • Ability to quickly pickup and understand where newly released cloud services would be appropriate for business applications
  • Working with the business to provide different options to help influence the application teams outcome and deliver a superior product
  • Having the ability to manage both the businesses requirements and costs while offering potential cost savings methods

Technical Qualifications:

  • Experience in coding Json/Yaml
  • Knowledge of Jenkins/Ansible
  • Ability to diagnose and debug complex problems in production environments spanning public and private clouds
  • AWS container deployments including ECS and Fargate
  • KMS Key creation and understanding of the service
  • Knowledge of IAM- Policy creation and troubleshooting
  • Understanding of AWS serverless architectures and services
  • Prior experience with AWS full stack creation pipeline including codecommit, code deploy and code pipeline
  • Understanding of Bitbucket and code review
  • Knowledge of AWS networking, service catalog, encryption methods and best practices
  • AWS Security Group knowledge and creation
  • CloudWatch logs and alarms creations and knowledge
  • Understanding of Autoscale groups with SSL certs, AWS cost calculator
  • Technical understanding of RDS and deployment options
  • Knowledge of Secrets Manger and Parameter store and basic AWS services (EC2 and S3)
  • Systems Manager automation documents
  • Understanding of AWS / assumed roles
  • Strong understanding of developing and testing highly-available systems with appropriate considerations for disaster recovery
  • AWS CLI including Boto3
 

General Qualifications:

  • MS/BS degree in Computer Science or related technical field, or equivalent practical experience
  • Expertise in designing, analyzing and troubleshooting large-scale cloud-based environments
  • Ability to manage multiple projects at the same time
  • Working with external vendors to help drive solutions
  • A systematic problem-solving approach, coupled with a strong sense of ownership and drive
  • Excellent communication, presentation, influencing and reasoning skills
More Openings
Java Resource
Oracle DBA
Share This Job
Powered by