Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
polynomial
80 days ago
|
parent
|
context
|
favorite
| on:
1.5 TB of VRAM on Mac Studio – RDMA over Thunderbo...
BUILD AI has a post about this and in particular sharding k-v cache across GPUs, and how network is the new memory hierarchy:
https://buildai.substack.com/p/kv-cache-sharding-and-distrib...
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://buildai.substack.com/p/kv-cache-sharding-and-distrib...