Location: San Francisco (Onsite)
Type: Full-time
Start Date: ASAP
- Design and implement network architectures for GPU clusters and high-performance computing environments
- Build real-time network topology discovery and awareness systems
- Debug and optimize data center fabric performance across switches, NICs, and interconnects
- Develop automation for network configuration, validation, and deployment at scale
- Integrate network telemetry into our hardware intelligence platform
- Work on network stack optimization for low-latency, high-throughput workloads
- Collaborate with customers deploying AI infrastructure in enterprise and hyperscale environments
Strong experience with:
- Data center network architectures (spine-leaf, Clos, fat-tree)
- Network protocols (BGP, OSPF, EVPN-VXLAN, RDMA/RoCE)
- High-performance networking (InfiniBand, 100G/400G Ethernet, SmartNICs)
- Network operating systems (Cumulus, SONiC, Arista EOS, Cisco NX-OS)
- Network automation and programmability (Python, Ansible, Netconf/YANG)
- Packet capture and analysis (Wireshark, tcpdump, eBPF)
Familiarity with:
- GPU cluster networking and AI/ML workload requirements
- Storage networking (NVMe-oF, iSCSI, FC)
- Network telemetry and observability (streaming telemetry, sFlow, NetFlow)
- Linux networking stack and kernel networking
- SDN controllers and network virtualization
We're looking for a network engineer who understands data center fabric at the packet level and can build intelligent systems that make network configuration and debugging effortless.
You should have:
- Deep expertise in modern data center network design and operations
- Experience with high-performance computing or AI infrastructure networking
- Ability to debug complex network issues across multiple layers
- Comfort working at the intersection of hardware, software, and network
- Excitement about automating away the pain of network operations
Requirements:
- Bachelor's or Master's in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience
- 4+ years of data center network engineering experience
- Willingness to work startup hours, in-person (weekends included) at our San Francisco office
- Work authorization in the United States
We're building the intelligence layer for hardware — real-time systems that control physical machines with zero tolerance for latency or failure.
What we offer:
- Startup-level equity and highly competitive salary
- Ownership over network intelligence that powers GPU clusters and AI infrastructure
- Problems at the cutting edge of data center networking and automation
- Close collaboration with customers building next-generation AI infrastructure
Email: team@cosmiclabs.io
Subject line: Data Center Network / [Your Name]
Include in your email:
- Your name
- Why this role and why Cosmic Labs
- What you bring technically
- Soonest available start date
- GitHub or GitLab link
- Confirmation of work authorization in the U.S.
- Confirmation of willingness to work full-time, in-person in San Francisco
Attach: PDF resume