LMCache
Open-source project (9.2K stars) that boosts inference speed for self-hosted large models and saves GPU memory; joined PyTorch Foundation, integrated by NVID...
Trust score100/100
About
Open-source project (9.2K stars) that boosts inference speed for self-hosted large models and saves GPU memory; joined PyTorch Foundation, integrated by NVIDIA Dynamo
Are you the author of this project?
Claim this pre-seeded listing to manage details, edit tags, or upload assets.
Please sign in using the button in the header to claim repository ownership.
Explore LMCache

