load_tensors: layer 0 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 1 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 2 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 3 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 4 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 5 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 6 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 7 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 8 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 9 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 10 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 11 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 12 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 13 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 14 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 15 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 16 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 17 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 18 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 19 assigned to device Vulkan1, is_swa = 0 load_tensors: layer 20 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 21 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 22 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 23 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 24 assigned to device Vulkan1, is_swa = 0 load_tensors: layer 25 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 26 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 27 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 28 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 29 assigned to device Vulkan1, is_swa = 0 load_tensors: layer 30 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 31 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 32 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 33 assigned to device Vulkan1, is_swa = 1 load_tensors: layer 34 assigned to device Vulkan1, is_swa = 0 load_tensors: layer 35 assigned to device Vulkan1, is_swa = 0
load_tensors: offloading output layer to GPU load_tensors: offloading 34 repeating layers to GPU load_tensors: offloaded 36/36 layers to GPU load_tensors: Vulkan0 model buffer size = 0.00 MiB load_tensors: Vulkan1 model buffer size = 0.00 MiB load_tensors: Vulkan_Host model buffer size = 0.00 MiB llama_context: constructing llama_context |
load_tensors: layer 0 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 1 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 2 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 3 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 4 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 5 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 6 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 7 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 8 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 9 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 10 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 11 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 12 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 13 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 14 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 15 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 16 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 17 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 18 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 19 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 20 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 21 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 22 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 23 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 24 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 25 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 26 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 27 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 28 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 29 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 30 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 31 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 32 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 33 assigned to device Vulkan0, is_swa = 1 load_tensors: layer 34 assigned to device Vulkan0, is_swa = 0 load_tensors: layer 35 assigned to device Vulkan0, is_swa = 0
load_tensors: offloading output layer to GPU load_tensors: offloading 34 repeating layers to GPU load_tensors: offloaded 36/36 layers to GPU load_tensors: CPU_Mapped model buffer size = 1756.00 MiB load_tensors: Vulkan0 model buffer size = 1407.73 MiB .|.../.........-..........\..........|... common_init_result: added <eos> logit bias = -inf common_init_result: added <|tool_response> logit bias = -inf common_init_result: added <turn|> logit bias = -inf llama_context: constructing llama_context
|