Staff Engineer- Platform Engineering
ThoughtSpotResume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
About the Role
We are looking for a Staff Engineer, someone with 9+ years of exp to join our Cloud Platform Engineering team and take deep ownership of critical platform systems within a multi-tenant SaaS environment.
This role sits at the intersection of backend engineering and cloud infrastructure , focused on building and operating scalable, reliable systems powering both the control plane and data plane .
As a senior technical leader, you will drive key architectural decisions, lead high-impact platform initiatives, and mentor engineers—while remaining hands-on in solving complex engineering problems at scale.
What You Will Do
Architecture & Platform Design
- Lead the architecture and design of core platform subsystems across control plane and data plane
- Build multi-tenant systems with strong isolation, security, and resource governance
- Define and evolve platform abstractions across cloud environments (AWS, GCP, Azure, on-prem)
- Drive architectural reviews and contribute high-quality design proposals (RFCs)
Control Plane Engineering
- Design and build systems for tenant provisioning, lifecycle management, and configuration
- Develop cluster orchestration and management workflows at scale
- Build APIs and automation to enable self-service for internal teams and customers
- Ensure high availability, observability, and auditability of platform services
Data Plane Engineering
- Build high-throughput, low-latency data path components
- Implement strong tenant isolation at the data layer
- Optimize systems for performance, reliability, and cost efficiency
Engineering Excellence
- Uphold high standards in system design, code quality, and operational excellence
- Identify and mitigate risks across reliability, scalability, and security
- Partner with SRE/DevOps on observability, incident response, and capacity planning
Mentorship & Collaboration
- Mentor engineers through design reviews, code reviews, and hands-on guidance
- Contribute to hiring and help maintain a high technical bar
- Collaborate across teams and geographies to drive platform direction
What You’ll Have
SaaS Platform & Multi-Tenancy
- Strong experience building or operating multi-tenant SaaS platforms (silo/pool/bridge models)
- Expertise in tenant lifecycle management and resource governance
- Knowledge of data isolation strategies (schema-per-tenant, DB-per-tenant, row-level security, encryption)
Control Plane & Data Plane Systems
- Experience with control plane/data plane separation and cluster management
- Hands-on with Kubernetes (operators, CRDs, RBAC, namespaces)
- Familiarity with configuration management at scale (GitOps, feature flags, dynamic config)
Cloud & Infrastructure
- Hands-on experience with AWS, GCP, or Azure
- Strong understanding of VPC, IAM, Kubernetes (EKS/GKE/AKS), and IaC (Terraform/Pulumi)
- Exposure to service mesh technologies (Istio, Linkerd, Envoy)
Distributed Systems & Security
- Deep understanding of distributed systems (HA, fault tolerance, scalability)
- Experience with observability tools (Prometheus, Grafana, OpenTelemetry)
- Knowledge of security best practices (zero trust, secrets management, compliance standards)
AI-Augmented Engineering
- Comfortable using AI tools (Copilot, Cursor, Claude) to improve productivity
- Ability to evaluate and apply AI-generated outputs effectively
Good to Have
- Exposure to FinOps (cost attribution, showback/chargeback, cost optimization)
- Experience building observability platforms (logging, metrics, tracing, alerting)
- Contributions to open-source infrastructure or platform projects
What Success Looks Like In 3 months:
- Understand platform architecture and key systems
- Deliver at least one meaningful improvement
- Build strong working relationships
In 6 months:
- Own and deliver a significant platform feature end-to-end
- Identify and address scalability/reliability gaps
- Contribute to hiring and team growth
In 12 months:
- Be a recognized technical leader within the platform team
- Drive measurable improvements in reliability, scale, or developer experience
- Mentor engineers who demonstrate clear growth
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free