Senior DevOps/OpenShift Platform Engineer for AI/ML workloads [Netherlands]


 

$ads={1}

About the Role
We are seeking a highly skilled and motivated Senior DevOps/OpenShift Platform Engineer for AI/ML workloads to join our dynamic team. As a Senior DevOps/OpenShift Platform Engineer, you will be responsible for the implementation, and maintenance of our on-prem Red Hat OpenShift Container Platform infrastructure tailored for AI/ML workloads.
You will collaborate with cross-functional teams across the organisation to support the development, deployment, and operations of our applications, services, and infrastructure.
You are self-motivated, proactive and thrive in taking ownership of your work. Your proactive nature enables you to identify opportunities for improvement, propose innovative solutions, and take the initiative to implement them.
Delivering high-quality results and committing to meeting deadlines are key contributors to the success of our team and organisation.
The AI/ML Platform squad is part of the Platform Tribe. Our squad delivers and operates on-prem Red Hat OpenShift Container Platform clusters tailored for AI/ML workloads.
As part of our commitment to Agile principles, we have embraced the Scrum framework to enhance productivity transparency, and adaptability in our activities. We value iterative and incremental development, which allows us to deliver value to our stakeholders while improving our processes.
Our internal processes are highly automated and focus on security, scalability, and operability. Our automation for infrastructure and application deployment follows the GitOps approach.
Key responsibilities:
  • Work with the Product Owner and Squad members to build and operate the SWIFT AI/ML Platform
  • Build scalable, secure, and highly available infrastructure using company and industry best practices
  • Develop and maintain automation for the deployment, configuration and ongoing management of the AI/ML Platform including components such as: OpenShift Container Platform, OpenShift Data Foundation, Kove software defined memory, Aqua Container Security, and InfiniBand network
  • Analyse system performance and make recommendations for improvements
  • Work closely with development teams to optimize and streamline the onboarding and CI/CD processes
  • Troubleshoot and resolve issues related to infrastructure, deployment, security and performance
  • Automate, build, test and release in accordance with the GitOps approach
  • Ensure the AI/ML Platform adheres to security and compliance standards
  • Implement and maintain monitoring solutions to ensure high availability and performance of the AI/ML Platform
  • Document platform architecture, processes, and procedures
  • Implement and maintain relevant mechanisms related to access controls, security policies and encryption mechanisms
  • Participate in meetings and Scrum ceremonies
  • Collaborate with development, operations, and other stakeholders to provide technical expertise and support
Technical skills and competencies:
  • Bachelors’ degree or masters’ degree in Computer Science, Engineering, or related field (or equivalent work experience)
  • Strong experience in implementing, and managing OpenShift (v4) platform infrastructure
    • UPI deployments experience desired or good understanding of IPI deployments
  • Experience in deploying and managing OpenShift clusters in a production environment
  • Experience with infrastructure as code and GitOps approach (e.g. ArgoCD, Git)
  • Good scripting and automation skills (Bash, Python)
  • Experience with monitoring tools (e.g. Prometheus, Elastic)
  • Experience with container security tools (e.g. Aqua Security, Open Policy Agent)
  • Experience in a programming language such as Java or Go
  • Knowledge of containerization technologies (Docker, Podman) – 3+ years
  • Knowledge of Linux – 3+ years
  • Familiarity with security best practices and compliance standards, benchmarks, and recommendations
  • Good problem-solving and troubleshooting skills
  • Strong communication and collaboration skills, team player and customer focused
  • Any of the following certifications are a strong asset:
    • Certified Kubernetes Security Specialist (CKS)
    • Certified Kubernetes Administrator (CKA)
    • Red Hat Certified Specialist in OpenShift Administration (EX280)
    • Red Hat Certified Engineer (RHCE)
    • Red Hat Certified System Administrator (RHCSA)
  • Fluent in English (spoken and written)
About the Platform Tribe:
The Platform Tribe aims to deliver highly secure, reliable, cost-effective and easy-to-use standardized/generalized private cloud infrastructure and platform services allowing our business teams to focus on value delivery to Swift customers.

What we offer
We put you in control of career
We give you a competitive package
We help you perform at your best
We help you make a difference
We give you the freedom to be yourself

We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. An environment in which everyone’s voice counts and where you can reach your full potential regardless of
age, background, culture, colour, disability, gender, nationality, race, religion , sexual orientation, or veteran/military status.

$ads={2}


 

.

Post a Comment

Previous Post Next Post

Sponsored Ads

نموذج الاتصال