GGUF conversion of the Llama 3.1 8B Instruct model (https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) using the GGUF-my-repo tool (https://huggingface.co/spaces/ggml-org/gguf-my-repo).
This conversion uses importance matrix and the Q4_K_M for the quantization method.