Platform Reliability Engineer

Software Engineering

icon type Full Time

icon location Ho Chi Minh

LARION, a global software outsourcing partner with nearly 2 decades deep industry expertise. We are a 100% Vietnam technology company specializing in turnkey solutions and building highly skilled development teams for companies of all size and types of business. Run by a team of successful entrepreneurs and dedicated technical experts – LARION is a unique symphony where we create a frictionless future for customers with passion, while maintaining full compliance with your needs and objectives.

I. What You’ll Do

We are seeking a Platform Reliability Engineer, you will play a key role in managing and optimizing the operational aspects of the server and network infrastructure for a large financial buy-side organization. Your primary focus will be on reducing operational overhead, optimizing systems, managing configurations, and ensuring the reliability and performance of critical infrastructure.

Responsibilities:

  • Ensure the production reliability of the firm Linux-based research and trading platform as part of a globally distributed engineering team.
  • Provide rapid emergency response to production infrastructure issues.
  • Proactively understand internal client needs and effectively communicate them to leadership at both regional and global levels.
  • Identify risks, develop contingency plans, and implement solutions to mitigate them.
  • Develop and enhance the observability platform to monitor the performance and health of critical computing environments.
  • Participate in occasional (monthly) on-call rotations and support on-call staff during their shifts.
  • Contribute to organizational knowledge through documentation, education, and writing maintainable code.

II. What You’ll Bring

  • 2+ years of experience in SRE, DevOps, or other infrastructure engineering roles, preferably within the financial industry.
  • Strong understanding of Linux system internals, including kernel operations, memory management, and performance optimization.
  • In-depth knowledge of storage technologies, particularly those used in high-performance computing (GPFS experience is a plus).
  • Broad understanding of IT infrastructure components, such as networking, DNS, NTP/PTP, and NIS.
  • Proficiency in system automation, monitoring, and self-healing (experience with Salt is a plus).
  • Experience with container orchestration and virtualization technologies (e.g., Kubernetes, Nomad, VMware).
  • Familiarity with on-premises and cloud-based HPC infrastructure (operational knowledge of Slurm and GPU is a plus).
  • Understanding of AI technologies and their applications in infrastructure automation and management.
  • Experience with or a strong interest in implementing AI/ML solutions for infrastructure optimization, anomaly detection, or predictive analytics.
  • A passion for technology and automation, with a deep sense of curiosity and ownership.
  • A hands-on approach to problem-solving and a demonstrable enthusiasm for technology.
  • Excellent verbal and written English communication skills.

III. Why We’ll Love Working Here

1. Workplace

  • Join a vibrant, young and dynamic team working on cutting-edge projects & emerging technologies.
  • Collaborate with global experts & top tech talent to enhance your skills.
  • Thrive in a culture of openness, forward-thinking and innovation-driven team while encouraging your full potential.

2. Benefits Comprise

  • Competitive salary, 13th month salary and attractive performance bonuses.
  • Flexible hybrid working model with WFH 2 days per week.
  • Premium Healthcare and Accident insurance.
  • Annual health check package.
  • Free parking and allowances: Lunch, Marriage, Newborn baby, Bereavement and others applied.
  • A spacious pantry that is fully equipped with coffee maker, fridge, microwave and more for your most comfortable lunch time.
  • A wide range of sport and social activities: Yoga, Football, Badminton, Tech clubs, etc.
  • Annual company trip and teambuilding.
  • Chance to be honored quarterly and annually with recognition awards for individuals, teams, long-term service, etc.
  • Advanced English and appropriate soft skills training to assist your career development.
  • Engaging monthly events: Happy Gathering, Mini Game, Team Birthday Celebrations, Company’s Year-end party, etc.
  • Exclusive company supporting funds to ease your personal loans of Home, Vehicle, Tuition, etc.

IV. Additional information

  • Location: QTSC Building 1, Quang Trung Software City, Trung My Tay Ward (District 12), Ho Chi Minh City
  • Working Time: 8:30 AM – 6:00 PM from Monday to Friday (Lunch break between 12:00 PM to 1:30 PM)

Gentle notice to our candidate about Decree No.13/2023/NĐ-CP

According to Decree No.13/2023/NĐ-CP on protecting personal data, LARION would apply “Personal Data Processing Agreement” with all candidates to ensure compliance with the decree.

By submitting this application to LARION, you agree to allow LARION to proceed your provided information in accordance with “Personal Data Processing Agreement” that you have read, fully understood and agreed to the entire content at link https://larion.com/privacy-policy/

Career

Accepted file types: pdf, doc, docx, Max. file size: 10 MB.