Staff Platform Engineer - Taikun & Kubernetes support - K8S, RKE and Longhorn

This job is about developing and maintaining Kubernetes-based infrastructure tooling — specifically within the Taikun platform, with a focus on K8s, RKE, and Longhorn support — in the company that does hybrid data management and analytics, empowering organizations to build data-driven applications across multi-cloud and on-premises environments; you will do hands-on software development to enhance and sustain Kubernetes cluster lifecycle management, storage solutions, and cloud-native infrastructure capabilities as part of Cloudera's engineering team.
Key Responsibilities:
- Design, develop, and maintain software components related to Kubernetes (K8s) cluster management within the Taikun platform.
- Work with RKE (Rancher Kubernetes Engine) to support cluster provisioning, upgrades, and lifecycle operations.
- Develop and maintain integrations with Longhorn for distributed block storage within Kubernetes environments.
- Collaborate with cross-functional engineering teams to deliver reliable, scalable, and production-grade infrastructure tooling.
- Troubleshoot and resolve complex issues related to Kubernetes infrastructure, storage, and networking.
- Contribute to architectural decisions and technical roadmaps for Kubernetes support features.
- Participate in code reviews, ensuring high code quality and adherence to engineering best practices.
Requirements:
- Strong hands-on experience with Kubernetes (K8s), including cluster administration and lifecycle management.
- Proficiency with RKE (Rancher Kubernetes Engine) for cluster provisioning and management.
- Experience with Longhorn or similar distributed storage solutions in Kubernetes environments.
- Solid software development skills, with experience in languages and tools relevant to cloud-native development (e.g., Go, Python, or similar).
- Understanding of cloud infrastructure concepts across major cloud providers and on-premises environments.
- Ability to work effectively in a collaborative, distributed engineering team.
- Strong problem-solving skills and the ability to debug complex distributed systems issues.
- Experience with CI/CD pipelines and DevOps practices is a plus.
This job is about developing and maintaining Kubernetes-based infrastructure tooling — specifically within the Taikun platform, with a focus on K8s, RKE, and Longhorn support — in the company that does hybrid data management and analytics, empowering organizations to build data-driven applications across multi-cloud and on-premises environments; you will do hands-on software development to enhance and sustain Kubernetes cluster lifecycle management, storage solutions, and cloud-native infrastructure capabilities as part of Cloudera's engineering team.
Key Responsibilities:
- Design, develop, and maintain software components related to Kubernetes (K8s) cluster management within the Taikun platform.
- Work with RKE (Rancher Kubernetes Engine) to support cluster provisioning, upgrades, and lifecycle operations.
- Develop and maintain integrations with Longhorn for distributed block storage within Kubernetes environments.
- Collaborate with cross-functional engineering teams to deliver reliable, scalable, and production-grade infrastructure tooling.
- Troubleshoot and resolve complex issues related to Kubernetes infrastructure, storage, and networking.
- Contribute to architectural decisions and technical roadmaps for Kubernetes support features.
- Participate in code reviews, ensuring high code quality and adherence to engineering best practices.
Requirements:
- Strong hands-on experience with Kubernetes (K8s), including cluster administration and lifecycle management.
- Proficiency with RKE (Rancher Kubernetes Engine) for cluster provisioning and management.
- Experience with Longhorn or similar distributed storage solutions in Kubernetes environments.
- Solid software development skills, with experience in languages and tools relevant to cloud-native development (e.g., Go, Python, or similar).
- Understanding of cloud infrastructure concepts across major cloud providers and on-premises environments.
- Ability to work effectively in a collaborative, distributed engineering team.
- Strong problem-solving skills and the ability to debug complex distributed systems issues.
- Experience with CI/CD pipelines and DevOps practices is a plus.