Filled Positions

Thankz Hero

Software Reliability Engineer (Remote)

Are you looking to hire?

Thankz offers a range of outstanding Software Reliability Engineer (Remote) candidates. If you're searching for top talent in this field or a similar position, our team can find the ideal person who meets your specific needs and requirements.

As a Software Reliability Engineer, you will play a crucial role in ensuring the reliability, stability, and performance of our software systems. Your expertise in reliability engineering, troubleshooting, and automation will be instrumental in maintaining high-quality software delivery and customer satisfaction. 

What you'll be doing 

  • Designing and implementing reliability strategies and processes for our software systems 
  • Conducting reliability assessments and analyzing system performance to identify areas for improvement 
  • Collaborating with cross-functional teams to define and enforce reliability standards and best practices 
  • Developing and maintaining automated monitoring, alerting, and incident response systems 
  • Conducting root cause analysis and implementing corrective actions for system failures and performance issues
  • Building and maintaining tools for performance testing, load balancing, and capacity planning 
  • Participating in code and architecture reviews to ensure reliability and scalability 
  • Mentoring and guiding development teams on reliability engineering principles and best practices 
  • Staying up to date with emerging technologies and industry trends in software reliability engineering 

Requirements 

  • Bachelor's degree in Computer Science, Engineering, or a related field 
  • Proven experience as a Software Reliability Engineer or similar role 
  • C1/C2 English Level proficiency (both written and spoken English)  
  • Strong understanding of reliability engineering principles and practices 
  • Proficiency in troubleshooting and performance analysis of complex software systems 
  • Experience with automation and scripting for system monitoring and incident response
  • Familiarity with cloud platforms and technologies (e.g., AWS, Azure, GCP) 
  • Knowledge of programming languages like Java, Python, or Go 
  • Excellent problem-solving and analytical skills 
  • Strong collaboration abilities 

Preferred candidates that are highly skilled Software Reliability Engineer with a deep understanding of reliability engineering principles and practices. They should have a proven track record in troubleshooting and performance analysis of complex software systems. Experience in implementing automated monitoring and incident response systems, as well as proficiency in cloud platforms and programming languages, is highly valued. Strong problem-solving skills, excellent communication, and the ability to work effectively in a remote environment are key attributes we seek. 

We offer a full-time, US-hours remote job, 40-hour workweek Mon-Fri, with excellent prospects for long-term growth for an ambitious experienced Software Reliability Engineer (Remote). We can offer HMO and other benefits to Philippine candidates.