Sparse Attention Turns Long-Context Inference Into a Hardware Engineering Problem | Surf AI