Data collection, aggregation, storage, retrieval, visualization
Python, js, java programming
Define a vision and roadmap for distributed data platform and observability systems for large-scale AI and HPC clusters and workloads
Job Summary
Define a vision and roadmap for distributed data platform and observability systems for large-scale AI and HPC clusters and workloads.
Architect systems for data collection, aggregation, enrichment, storage, retrieval, and visualization to spectacularly improve efficiency, performance, and productivity of AI and HPC workloads.
Lead technical teams to develop, deploy, and operate observability solutions for multiple compute clusters around the world.
Matching Summary
Define a vision and roadmap for distributed data platform and observability systems for large-scale AI and HPC clusters and workloads.