InfraOps Lead

Share and work together with your friends !


• Diploma/Degree in Computer Science/Software Engineering/Engineering/Mathematics or related discipline.
• 5+ years of experience in enterprise on-cloud and/or on-premise infrastructure with multidisciplinary technical skills in virtualisation, containers, networks, storage & backup, security, firewalls, database administration and change management. Experience with hybrid architectures would be advantageous.
• With 2 to 4 years of hands-on experience in administering complex cloud-based applications involving services such as Kubernetes, OpenShift, AWS EKS and other cloud-native services such as AWS S3, Route 53, lambda, Kinesis, Kafka, IAM, SNS, SQS, KMS, CloudFormation…etc.
• With 1 to 3 years of experience with comprehensive assessment, development, implementation, and documentation of security processes, application security, data protection, cryptography, key management, identity and access management (IAM), and network security within SaaS, IaaS, PaaS, and other cloud-based environments.
• Strong understanding of cloud computing, including the various cloud deployment models, cloud service models, and cloud deployment architectures on Amazon Web Services (AWS) and Aliyun. Experience with other China-based solutions such as Tencent Cloud would be advantageous.
• Strong knowledge of Enterprise Firewalls (including Web Application Firewalls), Unified Threat Management, Web Filtering, Email Security, Two Factor Authentication, Site2Site IPSec VPN and other security related principals and technology would be advantageous.
• Familiar with ITIL and best practices in service management; certification in ITIL is a plus
• Knowledge in one or more of the following tools: various CI/CD tools, Jenkins, Docker, Ansible, CloudFormation, HashiCorp Terraform, Chef and/or Puppet would be advantageous.
• Ideal candidates possess one or more cloud computing certifications such as AWS Certified Solutions Architect, CompTIA Cloud+, Certified Cloud Security Professional (CCSP), Certified SysOps Administrator, Oracle Cloud Infrastructure Certified Architect Professional, AWS Certified DevOps Engineer or related certifications.
• Strong analytical, reasoning, and problem-solving skills with an ability to anticipate outcomes of a solution.
• Meticulous in analysis and documentation of new learnings and findings
• Maintain confidentiality of information processed or prepared.
• Strong communication and collaboration skills
• Self-driven and perform duties and responsibilities independently with minimum supervision
• Possess positive attitude and interpersonal skills and project a professional image
• Fluent in speaking, reading and typing in both English and Mandarin
• Proficiency in basic SQL queries is an advantage


• Work closely with clients and application teams to advise on best practices and optimal cloud-based architecture for their needs
• Work closely with clients and application teams to create an automated CI/CD pipeline and/or DevOps and change management processes
• Perform routine administration duties, such as patching, backup and restore, upgrades, configuration management, change management, and performance optimisation & tuning of multiple cloud systems on mainly Amazon Web Services (AWS) and Alibaba Cloud (Aliyun).
• Setup, manage and guide infra team to ensure daily support, maintenance, troubleshooting, infra stability, optimization and infra related tasks are fulfilled.
• Configuring dashboards, monitoring tools and threshold alerts to observe system health from the perspective of network, bandwidth, buffer, CPU, memory, storage, application responsiveness, instruction and take pre-emptive action to resolve of escalate the issues detected to ensure minimal customer impact.
• Provision new virtual machines, containers, database services, SaaS/PaaS services, as an when necessary while maintaining optimal cost-benefit and maintain SLA and expected uptime.
• Ensures compliance of architectural and engineering policies, standards and procedures and ensure that changes, configurations, transitions, and migrations controls are followed, documented and managed.
• Assist with application configuration, change management, deployments, and migrations from time-to-time when necessary.
• Provide timely Level 2/3 operations and technical support to the Level 1 Customer Service team
• Finding solutions from previous issues and educating the Level 1 Customer Service team
• Contribute pro-actively to improve response time, resolution time and reduce number of incidents
• Responding, investigating, resolving, communicating root cause analysis and recommending solutions in a timely manner
• Actively monitor the usage patterns of the suite of applications and pre-emptively identify abnormal behaviours and respond accordingly • Ensure infrastructure is stable and cost is optimized at all time
• Occasional light design and programming work relating to scripting and database queries
• Establishing a timeline and protocol for harder-to-solve problems and communicating troubleshooting analysis to stakeholders for resolution
• Compiling and providing a comprehensive handover of daily task to the team member on duty in subsequent shifts
• Obtain feedback and suggestions for product improvements

Like this job ? Share It Now & Work Together With Your Friends !