Staff Engineer- Platform Engineering
ThoughtSpotResume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
About the Role
We are looking for a Staff Engineer, someone with 9+ years of exp to join our Cloud Platform Engineering team and take deep ownership of critical platform systems within a multi-tenant SaaS environment.
This role sits at the intersection of backend engineering and cloud infrastructure, focused on building and operating scalable, reliable systems powering both the control plane and data plane.
As a senior technical leader, you will drive key architectural decisions, lead high-impact platform initiatives, and mentor engineers—while remaining hands-on in solving complex engineering problems at scale.
What You Will Do
Architecture & Platform Design
- Lead the architecture and design of core platform subsystems across control plane and data plane
- Build multi-tenant systems with strong isolation, security, and resource governance
- Define and evolve platform abstractions across cloud environments (AWS, GCP, Azure, on-prem)
- Drive architectural reviews and contribute high-quality design proposals (RFCs)
Control Plane Engineering
- Design and build systems for tenant provisioning, lifecycle management, and configuration
- Develop cluster orchestration and management workflows at scale
- Build APIs and automation to enable self-service for internal teams and customers
- Ensure high availability, observability, and auditability of platform services
Data Plane Engineering
- Build high-throughput, low-latency data path components
- Implement strong tenant isolation at the data layer
- Optimize systems for performance, reliability, and cost efficiency
Engineering Excellence
- Uphold high standards in system design, code quality, and operational excellence
- Identify and mitigate risks across reliability, scalability, and security
- Partner with SRE/DevOps on observability, incident response, and capacity planning
Mentorship & Collaboration
- Mentor engineers through design reviews, code reviews, and hands-on guidance
- Contribute to hiring and help maintain a high technical bar
- Collaborate across teams and geographies to drive platform direction
What You’ll Have
SaaS Platform & Multi-Tenancy
- Strong experience building or operating multi-tenant SaaS platforms (silo/pool/bridge models)
- Expertise in tenant lifecycle management and resource governance
- Knowledge of data isolation strategies (schema-per-tenant, DB-per-tenant, row-level security, encryption)
Control Plane & Data Plane Systems
- Experience with control plane/data plane separation and cluster management
- Hands-on with Kubernetes (operators, CRDs, RBAC, namespaces)
- Familiarity with configuration management at scale (GitOps, feature flags, dynamic config)
Cloud & Infrastructure
- Hands-on experience with AWS, GCP, or Azure
- Strong understanding of VPC, IAM, Kubernetes (EKS/GKE/AKS), and IaC (Terraform/Pulumi)
- Exposure to service mesh technologies (Istio, Linkerd, Envoy)
Distributed Systems & Security
- Deep understanding of distributed systems (HA, fault tolerance, scalability)
- Experience with observability tools (Prometheus, Grafana, OpenTelemetry)
- Knowledge of security best practices (zero trust, secrets management, compliance standards)
AI-Augmented Engineering
- Comfortable using AI tools (Copilot, Cursor, Claude) to improve productivity
- Ability to evaluate and apply AI-generated outputs effectively
Good to Have
- Exposure to FinOps (cost attribution, showback/chargeback, cost optimization)
- Experience building observability platforms (logging, metrics, tracing, alerting)
- Contributions to open-source infrastructure or platform projects
What Success Looks Like In 3 months:
- Understand platform architecture and key systems
- Deliver at least one meaningful improvement
- Build strong working relationships
In 6 months:
- Own and deliver a significant platform feature end-to-end
- Identify and address scalability/reliability gaps
- Contribute to hiring and team growth
In 12 months:
- Be a recognized technical leader within the platform team
- Drive measurable improvements in reliability, scale, or developer experience
•
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free