Job Description

Experience Level: Experienced Hire

Categories:

  • Engineering & Technology

Location(s):

  • Quay Building 8th Floor, Bagmane Tech Park, Bengaluru, IN

Moody’s is a developmental culture where we value candidates who are willing to grow. So, if you are excited about this opportunity but don’t meet every single requirement, please apply! You may be a perfect fit for this role or other open roles.

Moody's is a global integrated risk assessment firm that empowers organizations to make better decisions.

At Moody’s, we’re taking action. We’re hiring diverse talent and providing underrepresented groups with equitable opportunities in their careers. We’re educating, empowering and elevating our people, and creating a workplace where each person can be their true selves, reach their full potential and thrive on every level. Learn more about our DE&I initiatives, employee development programs and view our annual DE&I Report at moodys.com/diversity

Job Description

In this role you will be part of the global production operations team within Moody’s Rating Technology and will be responsible for implementation of site reliability practices. The candidate should become proficient in the applications and platforms they manage and their interdependencies and develop mechanisms for troubleshooting and apply technical know-how and problem-solving skills towards ensuring smooth functioning of systems.


This role demands a deep understanding of log analysis to pinpoint and resolve issues in production systems. It also involves debugging tasks, code reviews, and the implementation of logging standards in applications. Experience in log analysis and troubleshooting within Java or .NET frameworks is a must. A comprehensive grasp of business operations, applications, integrations, vendors, products, services, systems, and workflows is crucial.


The role requires experience in production support tasks, including monitoring, alerting, Application performance management, user experience, incident handling, and root cause analysis. The incumbent will comprehend business functionality, and apply monitoring, automation, and DevOps practices and collaborate with cross-functional teams to resolve complex problems and enhance application services.

Job Function and Responsibilities

  • Develop a deep understanding of the product supported, business function and technical architecture, down to the application code and data
  • Perform in-depth log analysis to identify and diagnose issues in production systems
  • Utilize monitoring tools like Splunk/Data Dog, Grafana, CloudWatch etc. to quickly zero-in on application and infrastructure issues
  • Collaborate with product teams by providing technical findings from Production incidents and assist in determining the root cause and resolution
  • Assist efforts to resolve high impact incidents by providing technical direction on the triage call and working with business, application, and other technical teams
  • Develop logging standards and identify areas of improvement in monitoring, application stability, and speed of determining root causes
  • Drive initiatives to improve efficiency and quicker issue detection and resolution times
  • Identify opportunities for automation and be a relentless champion to reduce manual and repetitive tasks
  • Ensure quality and timely communication is maintained with business and technology stakeholders on critical issues
  • Develop excellent working relationships with teammates and stakeholders across business and technology
  • Be proactive, laser focused on execution and promote a culture of continuous improvement

Qualifications

Minimum education and work experience required for this position include:

  • BS degree in Information Systems, Computer Science, Computer Engineering or equivalent
  • 7+ years of solid work experience in IT and Application Support / Technology Operations
  • Deep, hands on experience with log analysis and root cause identification
  • Sound experience with monitoring tools like AppDynamics/Grafana, DataDog/Splunk, or CloudWatch and ability to configure alerts and dashboards
  • Knowledge of .Net / Java and AWS cloud native application development along with knowledge of database technologies like PostgreSQL, Oracle or Sybase is required
  • Knowledge of AWS or comparable cloud hosting technologies
  • Ability to troubleshoot Cloud based systems and Linux and coordinate with technical SMEs
  • Exhibits a strong sense of urgency for high severity incidents. Able to assess the customer impact and provide tactical solutions
  • Good understanding of distributed systems architecture including database, middleware, server and container-based infrastructure etc. would be a plus.
  • Knowledge of Python and scripting is a plus
  • Some experience with GitHub and Jenkins is desired
  • Hands-on experience with Incident and problem management
  • Excellent problem-solving skills
  • Excellent verbal and written communication skills



Moody’s is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, sexual orientation, gender expression, gender identity or any other characteristic protected by law.

Candidates for Moody's Corporation may be asked to disclose securities holdings pursuant to Moody’s Policy for Securities Trading and the requirements of the position. Employment is contingent upon compliance with the Policy, including remediation of positions in those holdings as necessary.

For more information on the Securities Trading Program, please refer to the STP Quick Reference guide on ComplianceNet

Please note: STP categories are assigned by the hiring teams and are subject to change over the course of an employee’s tenure with Moody’s.

Application Instructions

Please click on the link below to apply for this position. A new window will open and direct you to apply at our corporate careers page. We look forward to hearing from you!

Apply Online