Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!
You will lead the performance engineering efforts across Spark, Kafka, Elasticsearch, and Middleware APIs, ensuring that our real-time data pipelines and services meet enterprise-grade SLAs.
As part of our high-performing engineering team, you will design and execute performance testing strategies, identify system bottlenecks, and work with development teams to implement performance improvements that support billions of cyber security events processing a day across our data platform.
Own the performance strategy across distributed systems which includes Hadoop, Spark, Kafka, Elasticsearch/OpenSearch, Big Data Components and APIs for each release.
Define, develop, and execute performance test plans, load tests, stress tests, and soak tests.
Proactively identify bottlenecks, resource contention, and latency issues using tools such as JMeter, Spark UI, Kafka Manager, Elastic Monitoring and App Dynamics.
Provide deep-dive analysis and recommendations on tuning and scaling Spark jobs, Kafka topics/partitions, ES queries, and API endpoints.
Strong knowledge of profiling, debugging, and observability tools (e.g., Spark UI, Athena, Grafana, ELK).
Additional Plus Competencies: