Please wait.
Job Description
Job Description :
To give you a little more direction, the candidate for this position really needs to be centered around monitoring and have a knowledge of several programming languages to accomplish tasks for the Customer Experience Team.
This position is a membership within the Automation Flow Cell which is the team that provides automated testing capabilities, designs and implements test platforms, and writes code to improve processes and increase efficiency through automation efforts.
I would also like to stress that candidates should also have a good working knowledge of Selenium, New Relic, and API implementations.
Its not the creation of APIs, but the integration of our code with outgoing connections to other application APIs. For instance, our code might automation open ServiceNow tickets via that API or our code might trigger a PagerDuty alert via that applications API. The candidate must be able to write our code to interact with those APIs and others.
Determine the availability, scalability, security and performance of the healthcare application platforms using 3rd party toolsets such as New Relic and DynaTrace.
Incorporate availability, monitoring, and capacity requirements in new and existing services to create services that are designed to meet the availability and capacity needs of the business
Identify gaps in current monitoring tools and areas of improvements, working towards delivering the tools and where required providing guidelines to development teams on Test Scenarios to be included in testing.
Identify all monitoring requirements are met and carry out periodic reviews of checks currently in place to ensure service meets or exceeds customer expectations.
Proactively review and recommend changes to the live infrastructure after ensuring the right validation has been implemented.
Participate in software and system performance analysis and tuning, service capacity planning and demand forecasting.
REQUIRED Skills and Experience
2-3 years working knowledge of Linux operating systems and their underlying components, system statistics, performance tuning, file systems and IO, solid scripting skills in Perl and shell.
2-3 years Working knowledge of networking, packet tracing, understanding latency and throughput.
A Site Reliability Engineer is responsible for ensuring the reliability, availability, and performance of a company's systems and infrastructure.
In the role of Site Reliability Engineer, we are responsible for finding out solutions of severe technical problems. We put in our efforts and coordinate with others to quickly come over these technical problems. We act as the software development experts of the company. We evolve the software system to improve and increase its reliability. We also leverage modern tools for generating reliability. We oversee the designing, coding, and testing processes of software. Being Site Reliability Engineers, we aim to deliver error-free systems that can work faster and efficiently. We also train and manage the team working under us.
Core tasks: