$ python3 main.py --listen 0.0.0.0 setup plugin alembic.autogenerate.schemas setup plugin alembic.autogenerate.tables setup plugin alembic.autogenerate.types setup plugin alembic.autogenerate.constraints setup plugin alembic.autogenerate.defaults setup plugin alembic.autogenerate.comments WARNING: You need pytorch with cu130 or higher to use optimized CUDA operations. Found comfy_kitchen backend cuda: {'available': True, 'disabled': True, 'unavailable_reason': None, 'capabilities': ['apply_rope', 'apply_rope1', 'dequantize_nvfp4', 'dequantize_per_tensor_fp8', 'quantize_mxfp8', 'quantize_nvfp4', 'quantize_per_tensor_fp8', 'scaled_mm_nvfp4']} Found comfy_kitchen backend eager: {'available': True, 'disabled': False, 'unavailable_reason': None, 'capabilities': ['apply_rope', 'apply_rope1', 'dequantize_mxfp8', 'dequantize_nvfp4', 'dequantize_per_tensor_fp8', 'quantize_mxfp8', 'quantize_nvfp4', 'quantize_per_tensor_fp8', 'scaled_mm_mxfp8', 'scaled_mm_nvfp4']} Found comfy_kitchen backend triton: {'available': True, 'disabled': True, 'unavailable_reason': None, 'capabilities': ['apply_rope', 'apply_rope1', 'dequantize_nvfp4', 'dequantize_per_tensor_fp8', 'quantize_mxfp8', 'quantize_nvfp4', 'quantize_per_tensor_fp8']} Checkpoint files will always be loaded safely. Total VRAM 11165 MB, total RAM 31755 MB pytorch version: 2.7.1+cu118 Set vram state to: NORMAL_VRAM Device: cuda:0 NVIDIA GeForce GTX 1080 Ti : cudaMallocAsync Using async weight offloading with 2 streams Enabled pinned memory 28579.0 Using pytorch attention Unsupported Pytorch detected. DynamicVRAM support requires Pytorch version 2.8 or later. Falling back to legacy ModelPatcher. VRAM estimates may be unreliable especially on Windows Python version: 3.10.12 (main, Mar 3 2026, 11:56:32) [GCC 11.4.0] ComfyUI version: 0.21.1 comfy-aimdo version: 0.3.0 comfy-kitchen version: 0.2.8 comfyui-frontend-package version: 1.43.18 comfyui-workflow-templates version: 0.9.77 comfyui-embedded-docs version: 0.5.0 comfy-kitchen version: 0.2.8 comfy-aimdo version: 0.3.0 [Prompt Server] web root: /home/minimonk/.local/lib/python3.10/site-packages/comfyui_frontend_package/static Asset seeder disabled
Import times for custom nodes: 0.0 seconds: /mnt/Downloads/ComfyUI/custom_nodes/websocket_image_save.py
Context impl SQLiteImpl. Will assume non-transactional DDL. Starting server
To see the GUI go to: http://0.0.0.0:8188 got prompt Using pytorch attention in VAE Using pytorch attention in VAE VAE load device: cuda:0, offload device: cpu, dtype: torch.float32 Found quantization metadata version 1 Using MixedPrecisionOps for text encoder Requested to load WanTEModel loaded completely; 6419.48 MB loaded, full load: True CLIP/text encoder model load device: cpu, offload device: cpu, current: cpu, dtype: torch.float16 Requested to load WanVAE 0 models unloaded. loaded partially; 0.00 MB usable, 0.00 MB loaded, 484.00 MB offloaded, 45.57 MB buffer reserved, lowvram patches: 0 Found quantization metadata version 1 Detected mixed precision quantization Using mixed precision operations Native ops: , emulated ops: mxfp8, float8_e4m3fn, float8_e5m2, nvfp4 model weight dtype torch.float16, manual cast: torch.float32 model_type FLOW Requested to load WAN21 0 models unloaded. loaded partially; 0.00 MB usable, 0.00 MB loaded, 13636.09 MB offloaded, 885.22 MB buffer reserved, lowvram patches: 0 0%| | 0/10 [00:33<?, ?it/s] !!! Exception during processing !!! Allocation on device Traceback (most recent call last): File "/mnt/Downloads/ComfyUI/execution.py", line 535, in execute output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data) File "/mnt/Downloads/ComfyUI/execution.py", line 335, in get_output_data return_values = await _async_map_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data) File "/mnt/Downloads/ComfyUI/execution.py", line 309, in _async_map_node_over_list await process_inputs(input_dict, i) File "/mnt/Downloads/ComfyUI/execution.py", line 297, in process_inputs result = f(**inputs) File "/mnt/Downloads/ComfyUI/nodes.py", line 1612, in sample return common_ksampler(model, noise_seed, steps, cfg, sampler_name, scheduler, positive, negative, latent_image, denoise=denoise, disable_noise=disable_noise, start_step=start_at_step, last_step=end_at_step, force_full_denoise=force_full_denoise) File "/mnt/Downloads/ComfyUI/nodes.py", line 1542, in common_ksampler samples = comfy.sample.sample(model, noise, steps, cfg, sampler_name, scheduler, positive, negative, latent_image, File "/mnt/Downloads/ComfyUI/comfy/sample.py", line 74, in sample samples = sampler.sample(noise, positive, negative, cfg=cfg, latent_image=latent_image, start_step=start_step, last_step=last_step, force_full_denoise=force_full_denoise, denoise_mask=noise_mask, sigmas=sigmas, callback=callback, disable_pbar=disable_pbar, seed=seed) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 1180, in sample return sample(self.model, noise, positive, negative, cfg, self.device, sampler, sigmas, self.model_options, latent_image=latent_image, denoise_mask=denoise_mask, callback=callback, disable_pbar=disable_pbar, seed=seed) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 1070, in sample return cfg_guider.sample(noise, latent_image, sampler, sigmas, denoise_mask, callback, disable_pbar, seed) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 1052, in sample output = executor.execute(noise, latent_image, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes) File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 995, in outer_sample output = self.inner_sample(noise, latent_image, device, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 981, in inner_sample samples = executor.execute(self, sigmas, extra_args, callback, noise, latent_image, denoise_mask, disable_pbar) File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 751, in sample samples = self.sampler_function(model_k, noise, sigmas, extra_args=extra_args, callback=k_callback, disable=disable_pbar, **self.extra_options) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context return func(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/k_diffusion/sampling.py", line 205, in sample_euler denoised = model(x, sigma_hat * s_in, **extra_args) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 400, in __call__ out = self.inner_model(x, sigma, model_options=model_options, seed=seed) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 954, in __call__ return self.outer_predict_noise(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 961, in outer_predict_noise ).execute(x, timestep, model_options, seed) File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 964, in predict_noise return sampling_function(self.inner_model, x, timestep, self.conds.get("negative", None), self.conds.get("positive", None), self.cfg, model_options=model_options, seed=seed) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 380, in sampling_function out = calc_cond_batch(model, conds, x, timestep, model_options) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 205, in calc_cond_batch return _calc_cond_batch_outer(model, conds, x_in, timestep, model_options) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 213, in _calc_cond_batch_outer return executor.execute(model, conds, x_in, timestep, model_options) File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/samplers.py", line 325, in _calc_cond_batch output = model.apply_model(input_x, timestep_, **c).chunk(batch_chunks) File "/mnt/Downloads/ComfyUI/comfy/model_base.py", line 182, in apply_model return comfy.patcher_extension.WrapperExecutor.new_class_executor( File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/model_base.py", line 226, in _apply_model model_output = self.diffusion_model(xc, t, context=context, control=control, transformer_options=transformer_options, **extra_conds) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1751, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1762, in _call_impl return forward_call(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/ldm/wan/model.py", line 644, in forward return comfy.patcher_extension.WrapperExecutor.new_class_executor( File "/mnt/Downloads/ComfyUI/comfy/patcher_extension.py", line 112, in execute return self.original(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/ldm/wan/model.py", line 664, in _forward return self.forward_orig(x, timestep, context, clip_fea=clip_fea, freqs=freqs, transformer_options=transformer_options, **kwargs)[:, :, :t, :h, :w] File "/mnt/Downloads/ComfyUI/comfy/ldm/wan/model.py", line 597, in forward_orig x = block(x, e=e0, freqs=freqs, context=context, context_img_len=context_img_len, transformer_options=transformer_options) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1751, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1762, in _call_impl return forward_call(*args, **kwargs) File "/mnt/Downloads/ComfyUI/comfy/ldm/wan/model.py", line 258, in forward y = self.ffn(torch.addcmul(repeat_e(e[3], x), self.norm2(x), 1 + repeat_e(e[4], x))) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1751, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1762, in _call_impl return forward_call(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/container.py", line 240, in forward input = module(input) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1751, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1762, in _call_impl return forward_call(*args, **kwargs) File "/home/minimonk/.local/lib/python3.10/site-packages/torch/nn/modules/activation.py", line 734, in forward return F.gelu(input, approximate=self.approximate) torch.OutOfMemoryError: Allocation on device
Memory summary: |===========================================================================| | PyTorch CUDA memory summary, device ID 0 | |---------------------------------------------------------------------------| | CUDA OOMs: 0 | cudaMalloc retries: 0 | |===========================================================================| | Metric | Cur Usage | Peak Usage | Tot Alloc | Tot Freed | |---------------------------------------------------------------------------| | Allocated memory | 5675 MiB | 7030 MiB | 0 B | 0 B | | from large pool | 0 MiB | 0 MiB | 0 B | 0 B | | from small pool | 0 MiB | 0 MiB | 0 B | 0 B | |---------------------------------------------------------------------------| | Active memory | 5675 MiB | 7030 MiB | 0 B | 0 B | | from large pool | 0 MiB | 0 MiB | 0 B | 0 B | | from small pool | 0 MiB | 0 MiB | 0 B | 0 B | |---------------------------------------------------------------------------| | Requested memory | 0 B | 0 B | 0 B | 0 B | | from large pool | 0 B | 0 B | 0 B | 0 B | | from small pool | 0 B | 0 B | 0 B | 0 B | |---------------------------------------------------------------------------| | GPU reserved memory | 10784 MiB | 10784 MiB | 0 B | 0 B | | from large pool | 0 MiB | 0 MiB | 0 B | 0 B | | from small pool | 0 MiB | 0 MiB | 0 B | 0 B | |---------------------------------------------------------------------------| | Non-releasable memory | 0 B | 0 B | 0 B | 0 B | | from large pool | 0 B | 0 B | 0 B | 0 B | | from small pool | 0 B | 0 B | 0 B | 0 B | |---------------------------------------------------------------------------| | Allocations | 0 | 0 | 0 | 0 | | from large pool | 0 | 0 | 0 | 0 | | from small pool | 0 | 0 | 0 | 0 | |---------------------------------------------------------------------------| | Active allocs | 0 | 0 | 0 | 0 | | from large pool | 0 | 0 | 0 | 0 | | from small pool | 0 | 0 | 0 | 0 | |---------------------------------------------------------------------------| | GPU reserved segments | 0 | 0 | 0 | 0 | | from large pool | 0 | 0 | 0 | 0 | | from small pool | 0 | 0 | 0 | 0 | |---------------------------------------------------------------------------| | Non-releasable allocs | 0 | 0 | 0 | 0 | | from large pool | 0 | 0 | 0 | 0 | | from small pool | 0 | 0 | 0 | 0 | |---------------------------------------------------------------------------| | Oversize allocations | 0 | 0 | 0 | 0 | |---------------------------------------------------------------------------| | Oversize GPU segments | 0 | 0 | 0 | 0 | |===========================================================================|
Got an OOM, unloading all loaded models. Prompt executed in 154.15 seconds |