view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 aamirshakir, tomaarsen, SeanLee97 β’ Mar 22, 2024 β’ 134
view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke β’ Aug 23, 2023 β’ 64
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ merve β’ Aug 25, 2023 β’ 39