Associate Director Software Engineering

Trimont · Hyderabad, Telangana, India

Full-time · Executive · Posted 1 month ago

Overview:

Founded in 1988 and headquartered in Atlanta, Georgia Trimont (www.trimont.com
[http://www.trimont.com]) is a specialized global commercial real estate loan
services provider and partner for lenders seeking the infrastructure and
capabilities needed to make informed, effective decisions related to the
deployment, management and administration of commercial real estate secured
credit.

We do this with a team of 1100+ extraordinary Team Members who serve a global
client base from offices in Atlanta, Bengaluru, Charlotte, Dallas, Hyderabad,
Kansas City, London, New York and Sydney. We empower our skilled global teams by
equipping them with the necessary knowledge and advanced technology, as well as
fostering a culture driven by values. This approach helps our teams excel and
build meaningful client relationships, while providing the highest quality
service and feeling proud of the work they do.

Trimont is an innovative firm where visionary professionals come to learn, grow,
and thrive with colleagues driven by curiosity and collaboration.

Learn: We believe ongoing learning is critical and are focused on providing a
work environment where individuals can take ownership of their careers.

Grow: We work alongside the largest institutional lenders in the world,
overseeing the most significant projects in the industry. This unique
opportunity allows us to broaden our skillset and develop our abilities by
tackling some of the industry's most challenging and exciting endeavors.

Thrive: Our firm is a place where ethics and excellence meet to create an
experience that matches our capabilities. There are no limits to what you as an
individual, and we as an organization, can achieve together.

Position Overview: The Production Support Engineer is a vital frontline role
focused on maintaining the stability, performance, and availability of Trimont’s
live, business‑critical applications and systems. This role acts as the primary
technical bridge between development and operations, ensuring uninterrupted
service for end users and business stakeholders.

The position provides advanced operational support for applications hosted on
Microsoft Azure, including Azure App Services, Virtual Machines, batch
scheduling (AutoSys), and streaming platforms (Confluent Kafka). The role
requires strong expertise in incident management, application debugging,
proactive monitoring, and root cause analysis, particularly within banking and
financial services environments operating under strict SLAs and regulatory
controls.

Summary:

We are seeking an experienced Production Support Engineer to support enterprise
banking and financial services platforms in a highly available, regulated
production environment. This is a hands‑on operational role that requires strong
troubleshooting and debugging skills, especially for Azure App Services, and the
ability to own production incidents end‑to‑end.

The ideal candidate is comfortable working under pressure, responding rapidly to
incidents, driving long‑term stability improvements, and collaborating closely
with development, DevOps, infrastructure, and business teams.

Responsibilities:

 

Incident Management & Troubleshooting

* Act as L2/L3 production support for enterprise banking and financial services
applications.
* Monitor systems proactively using tools such as Splunk, AppDynamics, Azure
Monitor, etc., to detect and prevent potential issues.
* Respond immediately to alerts, incidents, and outages to minimize business
impact.
* Debug and resolve complex production issues across applications, databases,
infrastructure, and integrations.
* Troubleshoot Azure App Services issues including crashes, performance
degradation, configuration errors, deployment failures, and connectivity
problems.
* Implement break‑fixes and immediate corrective actions to restore services
quickly.

Problem Prevention & System Stability

* Perform thorough Root Cause Analysis (RCA) for recurring and high‑impact
incidents.
* Collaborate with development teams to implement permanent fixes and prevent
incident recurrence.
* Identify stability risks and drive proactive improvements.
* Develop and implement automation scripts to reduce manual effort and improve
operational efficiency.

Azure, Batch & Integration Support

* Troubleshoot applications hosted on Azure App Services and Azure Virtual
Machines.
* Analyze logs, metrics, and telemetry to diagnose issues.
* Support AutoSys batch job scheduling, dependency failures, reruns, and
recovery scenarios.
* Monitor and troubleshoot Confluent Kafka platforms, including topics,
producers, consumers, offsets, and latency issues.
* Support REST APIs, integrations, and data flows across enterprise systems.

System Maintenance & Change Management

* Perform routine system health checks, maintenance, and operational
validations.
* Support safe deployment of application releases, patches, and configuration
changes.
* Adhere strictly to Change Management, Incident, and Problem Management
processes.
* Participate in release readiness reviews and production deployments.

Documentation & Communication

* Maintain accurate runbooks, SOPs, incident reports, and operational
documentation.
* Communicate clearly with business users, leadership, and technical teams
during incidents.
* Provide status updates, post‑incident summaries, and lessons learned.
* Participate in 24x7 on‑call support on a rotational basis for critical
systems.
* Mentor junior support engineers and share operational best practices.

Banking Domain, Security & Compliance

* Operate within banking and financial services environments, adhering to
security, audit, and regulatory standards.
* Support Business Continuity Planning (BCP) and Disaster Recovery testing
activities.

 

 

Requirements:

 

* Bachelor’s degree in Computer Science, Engineering, Information Systems, or
related field.
* 7+ years of experience in Production Support / Application Support roles.
* Strong hands‑on experience debugging Azure App Services in production.
* Solid experience supporting applications on Azure Virtual Machines.
* Strong knowledge of Linux/Unix commands and administration.
* Strong SQL skills and experience troubleshooting SQL Server, Oracle, or
similar databases.
* Experience with monitoring tools such as Splunk, AppDynamics, AutoSys, Azure
Monitor, etc.
* Hands‑on experience with AutoSys workload scheduling.
* Experience with Confluent Kafka or similar messaging platforms.
* Strong understanding of banking / financial services domain and production
SLAs.
* Experience with Incident, Problem, and Change Management methodologies.
* Excellent analytical, troubleshooting, and communication skills.
* Ability to work effectively under pressure in 24x7 production environments.

 

Trimont is an equal opportunity employer, and we’re proud to support and
celebrate diversity in the workplace. If you have a disability and need
accommodation or assistance with the application process and/or using our
website, please contact us. Trimont is a drug-free workplace.

Sign up to apply