Run 6Llama 3.1 8B Instruct Q4_K_M GGUF Model 3.1 Online

Description

GGUF conversion of the Llama 3.1 8B Instruct model (https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) using the GGUF-my-repo tool (https://huggingface.co/spaces/ggml-org/gguf-my-repo). This conversion uses importance matrix and the Q4_K_M for the quantization method.

Readme

Release Notes

EULA

Dependencies
No dependencies
Used By
No repositories
Website

No developer web address
Current
3.1 updated 17 days ago
Details

Updated:

Created: