Job Description
Job Summary:
We are looking for a highly skilled and proactive System Engineer with deep expertise in Linux, Kubernetes, and networking. The role will primarily focus on providing technical solutions, troubleshooting, and ensuring performance and scalability across enterprise systems. You will play a key part in solving architectural challenges related to operating systems, network configurations, and cloud platforms while maintaining a high level of system efficiency.
Key Responsibilities:
1. Requirement/Design/Architecture
- Perform unified data analysis for requirements, monitoring, and evaluations to continuously enhance the system.
- Propose comprehensive technical approaches to address key architectural issues, especially in operating systems, Kubernetes, and networking configurations.
2. Deployment Design
- Contribute to the design of deployment strategies for new installations, updates, and upgrades.
- Ensure deployment designs align with enterprise standards to promote consistency and efficiency.
3. Total Cost of Ownership (TCO)
- Define test strategies for NFRs and determine hardware and infrastructure requirements that are scalable and cost-effective.
- Devise resource-efficient infrastructure solutions that optimize performance and cost for the enterprise.
4. Technology Roadmap
- Identify and implement optimal technology solutions based on current requirements and future roadmaps.
- Conduct thorough technology evaluations to drive innovation and ensure alignment across product lines.
5. Knowledge Sharing
- Participates in Communities of Practice (CoPs) to facilitate knowledge sharing across teams.
- Promote best practices for platform and product usage across different sites, driving consistency in approach.
6. Tooling Strategy
- Identify and implement optimal tools and strategies for testing and evaluation, accelerating release cycles.
- Support unified tooling strategies for product evaluation and develop predictive models to assess feature changes.
7. Data Analysis and Feedback
- Analyze test and site feedback data to identify gaps and suggest improvements across the product portfolio.
Required Skills:
Linux
- Expert-level understanding of the Linux kernel, networking, vSwitch, and bridge networks.
- Proficiency in network connection analysis and using sniffing tools.
- Real-time troubleshooting skills on production systems.
- Expertise in profiling tools and techniques in Linux to optimize performance.
Kubernetes
- Expertise in diagnosing and troubleshooting performance issues in Kubernetes hybrid clusters, particularly around networking and shared storage.
Windows
- Basic to intermediate-level knowledge in Windows networking, Open vSwitch, and bridge networks.
- Familiarity with ETW trace analysis for performance diagnostics in Windows environments.