A Serving System for Distributed and Parallel LLM Quantization [Efficient ML System]
-
Updated
Jun 18, 2025 - Python
A Serving System for Distributed and Parallel LLM Quantization [Efficient ML System]
Due progetti in C per calcolare numeri primi e coppie di numeri primi gemelli.
Notes and Labs from MIT 6.5940, (Fall 2023) lecture.
Add a description, image, and links to the efficient-computing topic page so that developers can more easily learn about it.
To associate your repository with the efficient-computing topic, visit your repo's landing page and select "manage topics."