Senior Manager – Site Reliability Engineering, Datacenter Hardware and IaaS

Posted 2025-04-18
Remote, USA Part Time Immediate Start
Description:
• GEICO is seeking an experienced Senior Manager with a passion for building high performance, low-latency platforms, and applications.
• You will build and manage a team of engineers with a deep focus on delivering enterprise-wide product to operate in a highly performant and efficient way.
• The ideal candidate has deep technical expertise to improve application performance, capacity benchmarking, improve availability and reliability, design and evolve cloud infrastructure and architecture.
• A Senior Manager will lead strategy and execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities.

Requirements:
• Strong knowledge in modern at-scale datacenter architectures.
• Experience with OCP hardware and related technologies (eg. OpenBMC, Redfish), bonus for knowledge in low level driver development.
• Focus on leveraging infrastructure as code as a primary means of control.
• Building CI/CD chains for datacenter operations
• Experience in building IaaS systems based on OpenStack
• Knowledge of cloud computing technologies and concepts (SaaS, PaaS, IaaS, etc)
• Working knowledge of object-oriented development, Gang of Four (GOF) Design Patterns, Microservices, Dependency Injection with IOC containers, and both frontend and backend unit testing
• Proven ability to concentrate and demonstrate a capacity for learning technical concepts and adapting to new technologies quickly
• Strong Cloud (AWS, GCP, Azure etc.) platform knowledge
• Proficiency in Project Management and work item management tools such as Azure DevOps and Portfolio
• Strong foundation in algorithms, data structures, and core computer science concepts
• Experience in existing Operational Portals such as Azure Portal
• Fluency with Python, Golang, JSON, and RESTful Web Services
• Experience with application monitoring tools and performance assessments
• Experience in PowerShell Scripting
• Constructing, interpreting, and applying metrics to your work and decision making, able to use those metrics to identify correlation between drivers and results, and using that information to drive prioritization and action
• Strong understanding of Site Reliability Engineering and DevOps principles
• Strong technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity planning
• Expert in Container orchestration (e.g., Kubernetes), container runtimes and optimization
• Experience with driving cultural change in technical excellence, quality, and efficiency
• Experience managing and growing technical leaders and teams
• In-depth knowledge of CS data structures and algorithms

Benefits:
• Premier Medical, Dental and Vision Insurance with no waiting period**
• Paid Vacation, Sick and Parental Leave
• 401(k) Plan
• Tuition Reimbursement
• Paid Training and Licensures

Apply Job!

 

Apply To This Job
Check More Jobs
Back to Job Board