work DiffMerge Rethinking Token Merging for Semantic Binding in Diffusion-based Image Editing KV Streaming Fast Long-Context LLM Serving via Streaming Layerwise-Compressed KV Cache fun