/v1/chat/completions 통해서 문맥을 유지할때 어떻게 구현되나 했더니

llama-swap 에서 대화내용을 보니 이해된다.

assistant에 ai 대답을 넣는다고만 해서 복수개면 어떻게 하나 했는데

 

UI 상으로는 이렇게 나오고

 

로그 상으로는 아래와 같이 나온다

1번 째 질문 "하이하이"

 

2번 쩨 질문 "엉 왜 refused"

그리고 이전 대화를 messages의 배열에 순서대로 넣으면

가장 마지막 대화를 기준으로 답을 주게 되는걸려나?

당연(?) 하지만 reasoning은 빼고 순수 응답 내용만 assistant에 넣어서 보낸다.

Posted by 구차니

그런데 208 이던 228 이던

client.chat.completions.create 함수를

client.responses.create 로 바꾸었더니 prompt speed / gen speed가 출력되지 않는다.

reasoning off 하기 위해서는 함수를 바꾸어야 하고. 바꾸면 리포트가 안되고 흐음..

걍 서버에서 끄고 해야하나? (llama-cli --reasoning off)

 

'프로그램 사용 > ai 프로그램' 카테고리의 다른 글

llama-swap 구현 (채팅)  (0) 2026.06.18
llama-swap 버전 업데이트!  (0) 2026.06.18
stable diffusion --device-id  (0) 2026.06.18
stable diffusion illustruousXL LoRA  (0) 2026.06.15
statble diffusion XL 해상도  (0) 2026.06.15
Posted by 구차니

208 에서 228로 올렸더니

 

1. config.yaml 의 명시적 사용

기존에는 config.yaml을 바로 가져가더니(llama-swap 과 동일 경로에서) 이제는 명시적으로 지정해주어야 한다.1

$ ./llama-swap 
2026/06/18 12:43:40 ERROR -config is required

$ ./llama-swap --help
Usage of ./llama-swap:
  -config string
     path to config file (required)
  -listen string
     listen address (default :8080 or :8443 for TLS)
  -tls-cert-file string
     TLS certificate file
  -tls-key-file string
     TLS key file
  -version
     show version and exit
  -watch-config
     reload config on file change

 

2. 모니터링 추가

performance 탭에서 그래프가 생긴것 같다. 오오 이쁜데?



Posted by 구차니

버전에 따라 안 먹는게 있는지 라고 하기에는

저번에 잘 한거 같기도 한데 잘 모르겠네

 

아무튼 아래처럼 바꾸어 주면 자로딘다.

./webui.sh --server-name=0.0.0.0 --device-id=1 --api --medvram
CUDA_VISIBLE_DEVICES=1 ./webui.sh --server-name=0.0.0.0 --api --medvram

 

일단 현재 버전(?)은 아래와 같은 상태.

저번에 확장 깔다가 꼬여서 1.7 버전이라고 받아둔걸로 했더니 안되는건가..

$ git history
commit 1937682a20f7f0442311a1ede68f9f0cb480163b (HEAD -> dev, origin/dev)
Merge: 76759a18 fd0f475a
Author: w-e-w <40751091+w-e-w@users.noreply.github.com>
Date:   Mon Mar 2 16:00:53 2026 +0900

    Merge pull request #17313 from WhizZest/fix-setuptools-version
    
    Fix the issue of `pip install 'setuptools<70'` failing in cmd

$ git branch
* dev
  master

$ git remote -v
origin  https://github.com/AUTOMATIC1111/stable-diffusion-webui.git (fetch)
origin  https://github.com/AUTOMATIC1111/stable-diffusion-webui.git (push)
Posted by 구차니

개발환경 구축하려고 이것저것 보고 있는데

구형 모델을 쓰려고 하다보니(ssd mobilenet v3) 영 환경 잡는것 자체가 쉽지 않아 보인다.

 

[링크 : https://www.tensorflow.org/install/docker?hl=ko] 텐서플로우 docker

[링크 : https://github.com/NVIDIA/nvidia-docker] 요건 deprecated

[링크 : https://github.com/NVIDIA/nvidia-container-toolkit] 요걸 쓰라고

 

tensorflow 2.21.0

2026.03.07 release

지원 파이썬 버전 3.13 최대.

[링크 : https://pypi.org/project/tensorflow/]

 

tensorflow-gpu 2.12.0

2023.01.25 release

[링크 : https://pypi.org/project/tensorflow-gpu/]

 

그 와중에 우분투 26.04 의 파이썬 버전이 3.14 대이고

$ python3 --version
Python 3.14.4

 

텐서플로우는 지원버전이 없다고 배째고

$ pip3 install tensorflow
ERROR: Could not find a version that satisfies the requirement tensorflow (from versions: none)
ERROR: No matching distribution found for tensorflow

 

난 어짜피 gpu 가속받을거니까~ 하고 실행하는데 에러

$ pip3 install tensorflow-gpu
Collecting tensorflow-gpu
  Using cached tensorflow-gpu-2.12.0.tar.gz (2.6 kB)
  Installing build dependencies ... done
  Getting requirements to build wheel ... error
  error: subprocess-exited-with-error
  
  × Getting requirements to build wheel did not run successfully.
  │ exit code: 1
  ╰─> [77 lines of output]
      Traceback (most recent call last):
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/requirements.py", line 36, in __init__
          parsed = _parse_requirement(requirement_string)
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/_parser.py", line 71, in parse_requirement
          return _parse_requirement(Tokenizer(source, rules=DEFAULT_RULES))
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/_parser.py", line 89, in _parse_requirement
          url, specifier, marker = _parse_requirement_details(tokenizer)
                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/_parser.py", line 135, in _parse_requirement_details
          marker = _parse_requirement_marker(
              tokenizer,
          ...<5 lines>...
              ),
          )
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/_parser.py", line 156, in _parse_requirement_marker
          tokenizer.raise_syntax_error(
          ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^
              f"Expected {expected} or end",
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
              span_start=span_start,
              ^^^^^^^^^^^^^^^^^^^^^^
              span_end=None,
              ^^^^^^^^^^^^^^
          )
          ^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/_tokenizer.py", line 166, in raise_syntax_error
          raise ParserSyntaxError(
          ...<3 lines>...
          )
      packaging._tokenizer.ParserSyntaxError: Expected semicolon (after name with no version specifier) or end
          python_version>"3.7"
                        ^
      
      The above exception was the direct cause of the following exception:
      
      Traceback (most recent call last):
        File "/home/minimonk/venv/lib/python3.14/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module>
          main()
          ~~~~^^
        File "/home/minimonk/venv/lib/python3.14/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main
          json_out["return_val"] = hook(**hook_input["kwargs"])
                                   ~~~~^^^^^^^^^^^^^^^^^^^^^^^^
        File "/home/minimonk/venv/lib/python3.14/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 143, in get_requires_for_build_wheel
          return hook(config_settings)
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/build_meta.py", line 333, in get_requires_for_build_wheel
          return self._get_build_requires(config_settings, requirements=[])
                 ~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/build_meta.py", line 301, in _get_build_requires
          self.run_setup()
          ~~~~~~~~~~~~~~^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/build_meta.py", line 520, in run_setup
          super().run_setup(setup_script=setup_script)
          ~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/build_meta.py", line 317, in run_setup
          exec(code, locals())
          ~~~~^^^^^^^^^^^^^^^^
        File "<string>", line 40, in <module>
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/__init__.py", line 114, in setup
          _install_setup_requires(attrs)
          ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/__init__.py", line 85, in _install_setup_requires
          dist.parse_config_files(ignore_option_errors=True)
          ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/dist.py", line 764, in parse_config_files
          self._finalize_requires()
          ~~~~~~~~~~~~~~~~~~~~~~~^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/dist.py", line 382, in _finalize_requires
          self._normalize_requires()
          ~~~~~~~~~~~~~~~~~~~~~~~~^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/dist.py", line 400, in _normalize_requires
          self.install_requires = list_(map(str, _reqs.parse(install_requires)))
                                  ~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/tmp/pip-build-env-kb1b92jj/overlay/lib/python3.14/site-packages/setuptools/_vendor/packaging/requirements.py", line 38, in __init__
          raise InvalidRequirement(str(e)) from e
      packaging.requirements.InvalidRequirement: Expected semicolon (after name with no version specifier) or end
          python_version>"3.7"
                        ^
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
error: subprocess-exited-with-error

× Getting requirements to build wheel did not run successfully.
│ exit code: 1
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.

 

tensorflow 도 이제 인기가 없어졌다고 해야하나.. 지는 해가 되어버렸나 보다.

[링크 : https://github.com/tensorflow/tensorflow/issues/102890]

 

짧고 명확하다... deprecation. 후..

TensorFlow Object Detection API
TensorFlow 2.2 TensorFlow 1.15 Python 3.6

Deprecation

[링크 : https://github.com/tensorflow/models/blob/master/research/object_detection/README.md]

 

keras도 희망이 없고, pytorch도 좋은소리 안나오고 어떻게 해야하나..

by GPT

 

by claude

'프로그램 사용 > yolo_tensorflow' 카테고리의 다른 글

STFPM 실행  (0) 2026.06.01
딥러닝 학습 관련(epoch, loss)  (0) 2026.05.27
NAS - Neural Architecture Search  (0) 2026.05.21
MCUNet  (0) 2026.05.21
STFPM - Student-Teacher Feature Pyramid Matching  (0) 2026.05.20
Posted by 구차니

SDXL 이렇게 되어있어서 해봤는데 안되길래

혹시나 해서 illustriousXL 로 찾으니 나왔고 받아보니 된다!

 

정보를 보니 sdxl_base_v1-0

 

모델 정보에서 SDXL로 나온다.

 

[링크 : https://civitai.com/models/1003582/lora-illustriousxl-barbie-mugshot-meme]

 

요건 일러스트리어스에서는 안되는 놈인데

 

읭?

 

이것도 SD1 용이라 일러스트리어스에 안나온듯.

말로만 sdxl 이지 실제로는 Stable diffusion 계열인듯.

[링크 : https://civitai.com/models/242475/sdxl-lora-photorealistic-female]

 

요건 SD용 구름에 고래 잘 그려주는 lora.

메타데이터가 많은데 얘도 SD1용

 

일관성있는 어떤 데이터가 아닌가?

 

civitai 에서 sdxl로 치면 여러가지 나오는데

sdxl 1.0을 하면 되려나?

 

SDXL 1.0 으로 해서 받으니 일러스트리어스에서 뜨긴한다.

[링크 : https://civitai.com/models/553959/sdxl-lora-v3-pixelized-art-intermediate-edition]

'프로그램 사용 > ai 프로그램' 카테고리의 다른 글

llama-swap 버전 업데이트!  (0) 2026.06.18
stable diffusion --device-id  (0) 2026.06.18
statble diffusion XL 해상도  (0) 2026.06.15
stable diffusion, control net  (0) 2026.06.14
stable diffusion, lora  (0) 2026.06.14
Posted by 구차니

1024x1024로만 생성했었는데

생성을 조금이라도 빨리하려면 해상도가 낮으면 되려나?

가로 세로 픽셀수
1024 1024 1048576
1152 896 1032192
896 1152 1032192
1216 832 1011712
832 1216 1011712
1344 768 1032192
768 1344 1032192
1536 640 983040
640 1536 983040

 

그나저나 SD는 512x512 기본인데, 해상도 정보가 잘 안보인다.

 

[링크 : https://www.clien.net/service/board/cm_aigurim/18247681]

[링크 : https://platform.stability.ai/docs/legacy/grpc-api/features/api-parameters]

    [링크 : https://www.reddit.com/r/StableDiffusion/comments/15c3rf6/sdxl_resolution_cheat_sheet/?tl=ko]

'프로그램 사용 > ai 프로그램' 카테고리의 다른 글

stable diffusion --device-id  (0) 2026.06.18
stable diffusion illustruousXL LoRA  (0) 2026.06.15
stable diffusion, control net  (0) 2026.06.14
stable diffusion, lora  (0) 2026.06.14
gemma4-e4b 모델 변환 및 레이어 계산  (0) 2026.06.12
Posted by 구차니

저번에 플러그인 깔다가 박살나서 해보긴 좀 무서움.. 나중에 해봐야지

[링크 : https://www.internetmap.kr/entry/Stable-Diffusion-ControlNet1]

[링크 : https://huggingface.co/lllyasviel/ControlNet-v1-1/tree/main]

 

일단 이건 자세를 제한할 수 있는 녀석. 어떻게 보면.. 

그림 그려서 원하는 대로 수정하는 그런 느낌인데.. 맞나?

[링크 : https://www.internetmap.kr/entry/AUTOMATIC1111-Inpaint-Guide]

Posted by 구차니

lora도 model/lora에 넣고 리프레시 하고 나서

Lora 에서 클릭하면 긍정 프롬프트에 메시지가 추가된다.

 

테스트 해보니 0~1 이라는데 2가 완전 망가지는걸 보면, 소수점 단위로 테스트 해보면 다르게 나오려나?

미적용  <lora:outfit-humpback-pynoise:0.2>

 

 <lora:outfit-humpback-pynoise:0.5>  <lora:outfit-humpback-pynoise:1>

 

 

<lora:outfit-humpback-pynoise:2> <lora:outfit-humpback-pynoise:4>

 

모델이 같진 않으니 동일하게 나올순 없을 듯.

humpback, cloud,(building), sky, moon, star (sky), scenery, no humans, starry sky, night, fish, night sky, full moon, cloudy sky, outdoors, fantasy<lora:humpback-pynoise:0.8>
Negative prompt: nipples,extra fingers,fewer fingers,(low quality, worst quality:1.65), (bad anatomy), (inaccurate limb:1.2),bad composition, inaccurate eyes, extra digit,fewer digits,(extra arms:1.2),nipples,origin EasyNegative

Steps: 26,
CFG scale: 7,
Sampler: DPM++ 2M Karras,
Seed: 3421073841,
ENSD: 31337,
Size: 448x832, Model: Counterfeit-V3.0_fp16, Model hash: cbfba64e66, Hires steps: 18, Hires upscale: 1.9, Hires upscaler: 4x-AnimeSharp, Denoising strength: 0.45, Clip skip: 2

[링크 : https://civitai.com/models/101550/lora-cloudwhale-concept]

 

지브리 스타일.. 이런식으로 전체적인 화풍이 Lora를 통해서 바꿀수 있나보다.

[링크 : https://www.internetmap.kr/entry/How-to-LoRA-Model]

 

 

+

2026.06.15

프롬프트에 넣어도 적용이 안되길래 테스트 해보는데

 

SD 모델에서는 lora가 뜨는데

 

SDXL 에서는 lora가 안뜬다.

 

Posted by 구차니

심심해서(?) safetensor를 gguf로 변환하면서 출력되는 레이어 정보를 가지고 장난을 쳐봄

계산을 보면 75억 개. bilion이 10억이라서 대충 반올림 하면 8b 모델이 된다.

그래서 모델 사이즈도 8B로 나오나 보다.

[링크 : https://huggingface.co/google/gemma-4-E4B-it]

 

그럼에도 8B가 아니라 e4b인 이유는, 임베딩이 8B 이고 실제 유효한건 4.5라서 그렇게 표현한거 같은데

그럼.. 쓸데없이 크다는 건가?

 

오호. E2B는 정말로 5B네

[링크 : https://huggingface.co/google/gemma-4-E2B]

 

        7,518,069,290 sum
           
INFO:hf-to-gguf:rope_freqs.weight, torch.float32 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:token_embd.weight, torch.bfloat16 --> F16, shape = {2560, 262144} 2560 262144   671088640  
INFO:hf-to-gguf:per_layer_token_embd.weight, torch.bfloat16 --> F16, shape = {10752, 262144} 10752 262144   2818572288  
INFO:hf-to-gguf:blk.0.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.0.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.0.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.0.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.0.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.0.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.0.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.0.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.0.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.0.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.0.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.0.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.0.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.0.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.0.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.0.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.0.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.1.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.1.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.1.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.1.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.1.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.1.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.1.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.1.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.1.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.1.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.1.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.1.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.1.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.1.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.1.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.1.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.1.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.10.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.10.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.10.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.10.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.10.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.10.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.10.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.10.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.10.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.10.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.10.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.10.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.10.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.10.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.10.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.10.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.10.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.11.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.11.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.11.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.11.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.11.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.11.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.11.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.11.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.11.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.11.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.11.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.11.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.11.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.11.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.11.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.11.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.11.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.12.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.12.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.12.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.12.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.12.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.12.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.12.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.12.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.12.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.12.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.12.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.12.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.12.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.12.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.12.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.12.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.12.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.13.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.13.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.13.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.13.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.13.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.13.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.13.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.13.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.13.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.13.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.13.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.13.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.13.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.13.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.13.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.13.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.13.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.14.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.14.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.14.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.14.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.14.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.14.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.14.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.14.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.14.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.14.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.14.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.14.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.14.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.14.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.14.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.14.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.14.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.15.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.15.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.15.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.15.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.15.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.15.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.15.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.15.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.15.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.15.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.15.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.15.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.15.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.15.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.15.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.15.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.15.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.16.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.16.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.16.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.16.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.16.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.16.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.16.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.16.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.16.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.16.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.16.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.16.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.16.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.16.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.16.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.16.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.16.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.17.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.17.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.17.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.17.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.17.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.17.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.17.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.17.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.17.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.17.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.17.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.17.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.17.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.17.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.17.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.17.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.17.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.18.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.18.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.18.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.18.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.18.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.18.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.18.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.18.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.18.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.18.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.18.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.18.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.18.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.18.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.18.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.18.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.18.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.19.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.19.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.19.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.19.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.19.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.19.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.19.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.19.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.19.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.19.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.19.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.19.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.19.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.19.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.19.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.19.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.19.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.2.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.2.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.2.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.2.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.2.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.2.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.2.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.2.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.2.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.2.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.2.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.2.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.2.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.2.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.2.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.2.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.2.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.20.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.20.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.20.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.20.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.20.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.20.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.20.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.20.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.20.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.20.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.20.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.20.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.20.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.20.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.20.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.20.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.20.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.21.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.21.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.21.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.21.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.21.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.21.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.21.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.21.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.21.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.21.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.21.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.21.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.21.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.21.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.21.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.21.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.21.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.22.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.22.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.22.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.22.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.22.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.22.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.22.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.22.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.22.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.22.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.22.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.22.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.22.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.22.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.22.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.22.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.22.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.23.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.23.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.23.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.23.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.23.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.23.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.23.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.23.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.23.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.23.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.23.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.23.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.23.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.23.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.23.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.23.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.23.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.24.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.24.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.24.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.24.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.24.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.24.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.24.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.24.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.24.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.24.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.24.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.24.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.24.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.24.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.24.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.24.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.24.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.25.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.25.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.25.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.25.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.25.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.25.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.25.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.25.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.25.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.25.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.25.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.25.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.25.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.25.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.25.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.25.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.25.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.26.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.26.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.26.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.26.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.26.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.26.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.26.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.26.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.26.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.26.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.26.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.26.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.26.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.26.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.26.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.26.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.26.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.27.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.27.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.27.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.27.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.27.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.27.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.27.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.27.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.27.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.27.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.27.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.27.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.27.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.27.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.27.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.27.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.27.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.28.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.28.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.28.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.28.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.28.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.28.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.28.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.28.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.28.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.28.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.28.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.28.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.28.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.28.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.28.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.28.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.28.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.29.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.29.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.29.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.29.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.29.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.29.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.29.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.29.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.29.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.29.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.29.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.29.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.29.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.29.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.29.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.29.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.29.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.3.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.3.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.3.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.3.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.3.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.3.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.3.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.3.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.3.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.3.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.3.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.3.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.3.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.3.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.3.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.3.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.3.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.30.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.30.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.30.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.30.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.30.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.30.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.30.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.30.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.30.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.30.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.30.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.30.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.30.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.30.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.30.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.30.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.30.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.31.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.31.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.31.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.31.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.31.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.31.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.31.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.31.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.31.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.31.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.31.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.31.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.31.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.31.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.31.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.31.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.31.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.32.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.32.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.32.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.32.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.32.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.32.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.32.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.32.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.32.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.32.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.32.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.32.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.32.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.32.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.32.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.32.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.32.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.33.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.33.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.33.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.33.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.33.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.33.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.33.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.33.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.33.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.33.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.33.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.33.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.33.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.33.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.33.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.33.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.33.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.34.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.34.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.34.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.34.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.34.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.34.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.34.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.34.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.34.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.34.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.34.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.34.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.34.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.34.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.34.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.34.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.34.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.35.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.35.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.35.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.35.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.35.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.35.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.35.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.35.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.35.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.35.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.35.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.35.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.35.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.35.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.35.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.35.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.35.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.36.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.36.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.36.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.36.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.36.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.36.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.36.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.36.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.36.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.36.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.36.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.36.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.36.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.36.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.36.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.36.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.36.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.37.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.37.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.37.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.37.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.37.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.37.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.37.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.37.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.37.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.37.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.37.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.37.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.37.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.37.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.37.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.37.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.37.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.38.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.38.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.38.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.38.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.38.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.38.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.38.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.38.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.38.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.38.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.38.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.38.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.38.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.38.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.38.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.38.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.38.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.39.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.39.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.39.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.39.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.39.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.39.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.39.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.39.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.39.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.39.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.39.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.39.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.39.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.39.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.39.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.39.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.39.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.4.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.4.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.4.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.4.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.4.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.4.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.4.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.4.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.4.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.4.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.4.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.4.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.4.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.4.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.4.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.4.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.4.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.40.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.40.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.40.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.40.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.40.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.40.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.40.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.40.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.40.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.40.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.40.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.40.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.40.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.40.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.40.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.40.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.40.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.41.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.41.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.41.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.41.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.41.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.41.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.41.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.41.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.41.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.41.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.41.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.41.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.41.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.41.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.41.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.41.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.41.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.5.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.5.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.5.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.5.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.5.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.5.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.5.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.5.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.5.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.5.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.5.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.5.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.5.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.5.attn_output.weight, torch.bfloat16 --> F16, shape = {4096, 2560} 4096 2560   10485760  
INFO:hf-to-gguf:blk.5.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {512} 512     512  
INFO:hf-to-gguf:blk.5.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 4096} 2560 4096   10485760  
INFO:hf-to-gguf:blk.5.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 1024} 2560 1024   2621440  
INFO:hf-to-gguf:blk.6.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.6.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.6.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.6.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.6.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.6.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.6.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.6.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.6.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.6.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.6.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.6.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.6.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.6.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.6.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.6.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.6.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.7.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.7.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.7.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.7.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.7.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.7.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.7.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.7.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.7.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.7.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.7.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.7.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.7.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.7.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.7.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.7.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.7.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.8.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.8.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.8.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.8.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.8.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.8.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.8.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.8.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.8.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.8.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.8.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.8.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.8.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.8.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.8.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.8.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.8.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.9.attn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.9.layer_output_scale.weight, torch.bfloat16 --> F32, shape = {1} 1     1  
INFO:hf-to-gguf:blk.9.ffn_down.weight, torch.bfloat16 --> F16, shape = {10240, 2560} 10240 2560   26214400  
INFO:hf-to-gguf:blk.9.ffn_gate.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.9.ffn_up.weight, torch.bfloat16 --> F16, shape = {2560, 10240} 2560 10240   26214400  
INFO:hf-to-gguf:blk.9.inp_gate.weight, torch.bfloat16 --> F16, shape = {2560, 256} 2560 256   655360  
INFO:hf-to-gguf:blk.9.proj.weight, torch.bfloat16 --> F16, shape = {256, 2560} 256 2560   655360  
INFO:hf-to-gguf:blk.9.post_attention_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.9.post_ffw_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.9.post_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.9.ffn_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:blk.9.attn_k_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.9.attn_k.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:blk.9.attn_output.weight, torch.bfloat16 --> F16, shape = {2048, 2560} 2048 2560   5242880  
INFO:hf-to-gguf:blk.9.attn_q_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  
INFO:hf-to-gguf:blk.9.attn_q.weight, torch.bfloat16 --> F16, shape = {2560, 2048} 2560 2048   5242880  
INFO:hf-to-gguf:blk.9.attn_v.weight, torch.bfloat16 --> F16, shape = {2560, 512} 2560 512   1310720  
INFO:hf-to-gguf:output_norm.weight, torch.bfloat16 --> F32, shape = {2560} 2560     2560  
INFO:hf-to-gguf:per_layer_model_proj.weight, torch.bfloat16 --> F16, shape = {2560, 10752} 2560 10752   27525120  
INFO:hf-to-gguf:per_layer_proj_norm.weight, torch.bfloat16 --> F32, shape = {256} 256     256  

'프로그램 사용 > ai 프로그램' 카테고리의 다른 글

stable diffusion, control net  (0) 2026.06.14
stable diffusion, lora  (0) 2026.06.14
safetensors to gguf .. 2번째 시도 성공  (0) 2026.06.12
openai reasoning 옵션  (0) 2026.06.12
LLM temperature = 0.1 무한 반복  (0) 2026.06.12
Posted by 구차니