pollockjj
diff --git a/‎web/docs/CLIPLoaderDisTorch2MultiGPU.md‎
Lines changed: 52 additions & 0 deletions b/‎web/docs/CLIPLoaderDisTorch2MultiGPU.md‎
Lines changed: 52 additions & 0 deletions
diff --git a/‎web/docs/CLIPLoaderGGUFDisTorch2MultiGPU.md‎
Lines changed: 52 additions & 0 deletions b/‎web/docs/CLIPLoaderGGUFDisTorch2MultiGPU.md‎
Lines changed: 52 additions & 0 deletions
diff --git a/‎web/docs/CLIPLoaderGGUFMultiGPU.md‎
Lines changed: 19 additions & 0 deletions b/‎web/docs/CLIPLoaderGGUFMultiGPU.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎web/docs/CLIPLoaderMultiGPU.md‎
Lines changed: 19 additions & 0 deletions b/‎web/docs/CLIPLoaderMultiGPU.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎web/docs/CLIPVisionLoaderDisTorch2MultiGPU.md‎
Lines changed: 51 additions & 0 deletions b/‎web/docs/CLIPVisionLoaderDisTorch2MultiGPU.md‎
Lines changed: 51 additions & 0 deletions
diff --git a/‎web/docs/CLIPVisionLoaderMultiGPU.md‎
Lines changed: 18 additions & 0 deletions b/‎web/docs/CLIPVisionLoaderMultiGPU.md‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎web/docs/CheckpointLoaderSimpleDisTorch2MultiGPU.md‎
Lines changed: 53 additions & 0 deletions b/‎web/docs/CheckpointLoaderSimpleDisTorch2MultiGPU.md‎
Lines changed: 53 additions & 0 deletions
diff --git a/‎web/docs/CheckpointLoaderSimpleMultiGPU.md‎
Lines changed: 20 additions & 0 deletions b/‎web/docs/CheckpointLoaderSimpleMultiGPU.md‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎web/docs/ControlNetLoaderDisTorch2MultiGPU.md‎
Lines changed: 51 additions & 0 deletions b/‎web/docs/ControlNetLoaderDisTorch2MultiGPU.md‎
Lines changed: 51 additions & 0 deletions
@@ -0,0 +1,52 @@
+# CLIPLoaderDisTorch2MultiGPU
+
+The `CLIPLoaderDisTorch2MultiGPU` node is used to load standard CLIP text encoder models with DisTorch2 distributed tensor allocation, enabling advanced multi-device VRAM management to handle larger text encoding models across multiple GPUs.
+
+This node automatically detects models located in the `ComfyUI/models/clip` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_name` | `STRING` | The name of the CLIP model to load. |
+| `type` | `STRING` | The type of CLIP model (e.g., 'stable_diffusion', 'stable_diffusion_xl'). |
+| `device` | `STRING` | Target device for text encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+| `virtual_vram_gb` | `FLOAT` | Amount of virtual VRAM in gigabytes to allocate for distributed tensor management (default: 4.0, range: 0.0-128.0). |
+| `donor_device` | `STRING` | Device to donate VRAM from when allocating virtual memory (default: 'cpu'). |
+| `expert_mode_allocations` | `STRING` | Advanced allocation string for expert users to manually specify device/ratio distributions (e.g., 'cuda:0,50%;cpu,*'). |
+| `keep_loaded` | `BOOLEAN` | Whether to keep the model loaded when triggering memory cleanup operations (default: true). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model with DisTorch2 distributed allocation applied. |
+
+## DisTorch2 Distributed Loading
+
+DisTorch2 is an advanced memory management system that enables loading and running large diffusion models across multiple GPUs by intelligently distributing tensor allocations. Instead of loading an entire model on a single device, DisTorch2 splits the model's layers across available devices while maintaining computational efficiency.
+
+### Key Concepts
+
+**Virtual VRAM Allocation**: Artificially increases the available VRAM on the compute device by borrowing memory capacity from donor devices through intelligent tensor distribution.
+
+**Expert Mode Allocations**: Advanced users can manually specify exactly how much of the model should be placed on each device using ratio or byte-based allocation strings.
+
+### Allocation Examples
+
+**Basic Virtual VRAM Mode**:
+- `device`: `cuda:0`
+- `virtual_vram_gb`: `8.0`
+- `donor_device`: `cuda:1`
+- Result: Loads model as if cuda:0 had 8GB more VRAM available, using cuda:1 as memory donor.
+
+**Expert Ratio Allocation**:
+- `expert_mode_allocations`: `cuda:0,60%;cuda:1,30%;cpu,10%`
+- Distributes model layers with 60% on GPU 0, 30% on GPU 1, and 10% on CPU.
+
+**Expert Byte Allocation**:
+- `expert_mode_allocations`: `cuda:0,4gb;cuda:1,2gb;cpu,*`
+- Allocates exactly 4GB to cuda:0, 2GB to cuda:1, and remaining to CPU.
+
+**Mixed Mode**:
+Combines virtual VRAM with expert allocations for complex multi-device scenarios.
@@ -0,0 +1,52 @@
+# CLIPLoaderGGUFDisTorch2MultiGPU
+
+The `CLIPLoaderGGUFDisTorch2MultiGPU` node is used to load GGUF format CLIP text encoder models with DisTorch2 distributed tensor allocation, enabling advanced multi-device VRAM management to handle larger text encoding models across multiple GPUs.
+
+This node automatically detects models located in the `ComfyUI/models/clip` and `ComfyUI/models/clip_gguf` folders, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_name` | `STRING` | The name of the CLIP model to load from combined clip and clip_gguf folders. |
+| `type` | `STRING` | The type of CLIP model (e.g., 'stable_diffusion', 'stable_diffusion_xl'). |
+| `device` | `STRING` | Target device for text encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+| `virtual_vram_gb` | `FLOAT` | Amount of virtual VRAM in gigabytes to allocate for distributed tensor management (default: 4.0, range: 0.0-128.0). |
+| `donor_device` | `STRING` | Device to donate VRAM from when allocating virtual memory (default: 'cpu'). |
+| `expert_mode_allocations` | `STRING` | Advanced allocation string for expert users to manually specify device/ratio distributions (e.g., 'cuda:0,50%;cpu,*'). |
+| `keep_loaded` | `BOOLEAN` | Whether to keep the model loaded when triggering memory cleanup operations (default: true). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model with DisTorch2 distributed allocation applied. |
+
+## DisTorch2 Distributed Loading
+
+DisTorch2 is an advanced memory management system that enables loading and running large diffusion models across multiple GPUs by intelligently distributing tensor allocations. Instead of loading an entire model on a single device, DisTorch2 splits the model's layers across available devices while maintaining computational efficiency.
+
+### Key Concepts
+
+**Virtual VRAM Allocation**: Artificially increases the available VRAM on the compute device by borrowing memory capacity from donor devices through intelligent tensor distribution.
+
+**Expert Mode Allocations**: Advanced users can manually specify exactly how much of the model should be placed on each device using ratio or byte-based allocation strings.
+
+### Allocation Examples
+
+**Basic Virtual VRAM Mode**:
+- `device`: `cuda:0`
+- `virtual_vram_gb`: `8.0`
+- `donor_device`: `cuda:1`
+- Result: Loads model as if cuda:0 had 8GB more VRAM available, using cuda:1 as memory donor.
+
+**Expert Ratio Allocation**:
+- `expert_mode_allocations`: `cuda:0,60%;cuda:1,30%;cpu,10%`
+- Distributes model layers with 60% on GPU 0, 30% on GPU 1, and 10% on CPU.
+
+**Expert Byte Allocation**:
+- `expert_mode_allocations`: `cuda:0,4gb;cuda:1,2gb;cpu,*`
+- Allocates exactly 4GB to cuda:0, 2GB to cuda:1, and remaining to CPU.
+
+**Mixed Mode**:
+Combines virtual VRAM with expert allocations for complex multi-device scenarios.
@@ -0,0 +1,19 @@
+# CLIPLoaderGGUFMultiGPU
+
+The `CLIPLoaderGGUFMultiGPU` node is used to load GGUF format CLIP text encoder models with device selection capability, enabling users to specify which GPU or device should be used for model execution.
+
+This node automatically detects models located in the `ComfyUI/models/clip` and `ComfyUI/models/clip_gguf` folders, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_name` | `STRING` | The name of the CLIP model to load from combined clip and clip_gguf folders. |
+| `type` | `STRING` | The type of CLIP model (e.g., 'stable_diffusion', 'stable_diffusion_xl'). |
+| `device` | `STRING` | Target device for text encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model. |
@@ -0,0 +1,19 @@
+# CLIPLoaderMultiGPU
+
+The `CLIPLoaderMultiGPU` node is used to load CLIP text encoder models with device selection capability, enabling users to specify which GPU or device should be used for model execution.
+
+This node automatically detects models located in the `ComfyUI/models/clip` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_name` | `STRING` | The name of the CLIP model to load. |
+| `type` | `STRING` | The type of CLIP model (e.g., 'stable_diffusion', 'stable_diffusion_xl'). |
+| `device` | `STRING` | Target device for text encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model. |
@@ -0,0 +1,51 @@
+# CLIPVisionLoaderDisTorch2MultiGPU
+
+The `CLIPVisionLoaderDisTorch2MultiGPU` node is used to load CLIP Vision models with DisTorch2 distributed tensor allocation, enabling advanced multi-device VRAM management to handle larger vision encoder models across multiple GPUs.
+
+This node automatically detects models located in the `ComfyUI/models/clip_vision` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_vision` | `STRING` | The name of the CLIP Vision model to load. |
+| `device` | `STRING` | Target device for vision encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+| `virtual_vram_gb` | `FLOAT` | Amount of virtual VRAM in gigabytes to allocate for distributed tensor management (default: 4.0, range: 0.0-128.0). |
+| `donor_device` | `STRING` | Device to donate VRAM from when allocating virtual memory (default: 'cpu'). |
+| `expert_mode_allocations` | `STRING` | Advanced allocation string for expert users to manually specify device/ratio distributions (e.g., 'cuda:0,50%;cpu,*'). |
+| `keep_loaded` | `BOOLEAN` | Whether to keep the model loaded when triggering memory cleanup operations (default: true). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP_VISION` | `CLIP_VISION` | The loaded CLIP Vision model with DisTorch2 distributed allocation applied. |
+
+## DisTorch2 Distributed Loading
+
+DisTorch2 is an advanced memory management system that enables loading and running large diffusion models across multiple GPUs by intelligently distributing tensor allocations. Instead of loading an entire model on a single device, DisTorch2 splits the model's layers across available devices while maintaining computational efficiency.
+
+### Key Concepts
+
+**Virtual VRAM Allocation**: Artificially increases the available VRAM on the compute device by borrowing memory capacity from donor devices through intelligent tensor distribution.
+
+**Expert Mode Allocations**: Advanced users can manually specify exactly how much of the model should be placed on each device using ratio or byte-based allocation strings.
+
+### Allocation Examples
+
+**Basic Virtual VRAM Mode**:
+- `device`: `cuda:0`
+- `virtual_vram_gb`: `8.0`
+- `donor_device`: `cuda:1`
+- Result: Loads model as if cuda:0 had 8GB more VRAM available, using cuda:1 as memory donor.
+
+**Expert Ratio Allocation**:
+- `expert_mode_allocations`: `cuda:0,60%;cuda:1,30%;cpu,10%`
+- Distributes model layers with 60% on GPU 0, 30% on GPU 1, and 10% on CPU.
+
+**Expert Byte Allocation**:
+- `expert_mode_allocations`: `cuda:0,4gb;cuda:1,2gb;cpu,*`
+- Allocates exactly 4GB to cuda:0, 2GB to cuda:1, and remaining to CPU.
+
+**Mixed Mode**:
+Combines virtual VRAM with expert allocations for complex multi-device scenarios.
@@ -0,0 +1,18 @@
+# CLIPVisionLoaderMultiGPU
+
+The `CLIPVisionLoaderMultiGPU` node is used to load CLIP Vision models with device selection capability, enabling users to specify which GPU or device should be used for vision encoder execution.
+
+This node automatically detects models located in the `ComfyUI/models/clip_vision` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `clip_vision` | `STRING` | The name of the CLIP Vision model to load. |
+| `device` | `STRING` | Target device for vision encoder compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CLIP_VISION` | `CLIP_VISION` | The loaded CLIP Vision model. |
@@ -0,0 +1,53 @@
+# CheckpointLoaderSimpleDisTorch2MultiGPU
+
+The `CheckpointLoaderSimpleDisTorch2MultiGPU` node is used to load checkpoint models (complete diffusion models containing UNet, CLIP, and VAE components) with DisTorch2 distributed tensor allocation, enabling advanced multi-device VRAM management to handle larger models across multiple GPUs.
+
+This node automatically detects models located in the `ComfyUI/models/checkpoints` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `ckpt_name` | `STRING` | The name of the checkpoint model to load. |
+| `compute_device` | `STRING` | Target device for compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+| `virtual_vram_gb` | `FLOAT` | Amount of virtual VRAM in gigabytes to allocate for distributed tensor management (default: 4.0, range: 0.0-128.0). |
+| `donor_device` | `STRING` | Device to donate VRAM from when allocating virtual memory (default: 'cpu'). |
+| `expert_mode_allocations` | `STRING` | Advanced allocation string for expert users to manually specify device/ratio distributions (e.g., 'cuda:0,50%;cpu,*'). |
+| `keep_loaded` | `BOOLEAN` | Whether to keep the model loaded when triggering memory cleanup operations (default: true). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `MODEL` | `MODEL` | The loaded UNet diffusion model with DisTorch2 distributed allocation applied. |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model. |
+| `VAE` | `VAE` | The loaded VAE decoder/encoder model. |
+
+## DisTorch2 Distributed Loading
+
+DisTorch2 is an advanced memory management system that enables loading and running large diffusion models across multiple GPUs by intelligently distributing tensor allocations. Instead of loading an entire model on a single device, DisTorch2 splits the model's layers across available devices while maintaining computational efficiency.
+
+### Key Concepts
+
+**Virtual VRAM Allocation**: Artificially increases the available VRAM on the compute device by borrowing memory capacity from donor devices through intelligent tensor distribution.
+
+**Expert Mode Allocations**: Advanced users can manually specify exactly how much of the model should be placed on each device using ratio or byte-based allocation strings.
+
+### Allocation Examples
+
+**Basic Virtual VRAM Mode**:
+- `compute_device`: `cuda:0`
+- `virtual_vram_gb`: `8.0`
+- `donor_device`: `cuda:1`
+- Result: Loads model as if cuda:0 had 8GB more VRAM available, using cuda:1 as memory donor.
+
+**Expert Ratio Allocation**:
+- `expert_mode_allocations`: `cuda:0,60%;cuda:1,30%;cpu,10%`
+- Distributes model layers with 60% on GPU 0, 30% on GPU 1, and 10% on CPU.
+
+**Expert Byte Allocation**:
+- `expert_mode_allocations`: `cuda:0,4gb;cuda:1,2gb;cpu,*`
+- Allocates exactly 4GB to cuda:0, 2GB to cuda:1, and remaining to CPU.
+
+**Mixed Mode**:
+Combines virtual VRAM with expert allocations for complex multi-device scenarios.
@@ -0,0 +1,20 @@
+# CheckpointLoaderSimpleMultiGPU
+
+The `CheckpointLoaderSimpleMultiGPU` node is used to load checkpoint models (complete diffusion models containing UNet, CLIP, and VAE components) with device selection capability, enabling users to specify which GPU or device should be used for model execution.
+
+This node automatically detects models located in the `ComfyUI/models/checkpoints` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `ckpt_name` | `STRING` | The name of the checkpoint model to load. |
+| `device` | `STRING` | Target device for compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `MODEL` | `MODEL` | The loaded UNet diffusion model. |
+| `CLIP` | `CLIP` | The loaded CLIP text encoder model. |
+| `VAE` | `VAE` | The loaded VAE decoder/encoder model. |
@@ -0,0 +1,51 @@
+# ControlNetLoaderDisTorch2MultiGPU
+
+The `ControlNetLoaderDisTorch2MultiGPU` node is used to load ControlNet models with DisTorch2 distributed tensor allocation, enabling advanced multi-device VRAM management to handle larger conditional generation models across multiple GPUs.
+
+This node automatically detects models located in the `ComfyUI/models/controlnet` folder, and it will also read models from additional paths configured in the `extra_model_paths.yaml` file. Sometimes, you may need to **refresh the ComfyUI interface** to allow it to read the model files from the corresponding folder.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `control_net_name` | `STRING` | The name of the ControlNet model to load. |
+| `compute_device` | `STRING` | Target device for compute operations (e.g., 'cuda:0', 'cuda:1', 'cpu'). Selected from available devices on your system. |
+| `virtual_vram_gb` | `FLOAT` | Amount of virtual VRAM in gigabytes to allocate for distributed tensor management (default: 4.0, range: 0.0-128.0). |
+| `donor_device` | `STRING` | Device to donate VRAM from when allocating virtual memory (default: 'cpu'). |
+| `expert_mode_allocations` | `STRING` | Advanced allocation string for expert users to manually specify device/ratio distributions (e.g., 'cuda:0,50%;cpu,*'). |
+| `keep_loaded` | `BOOLEAN` | Whether to keep the model loaded when triggering memory cleanup operations (default: true). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `CONTROL_NET` | `CONTROL_NET` | The loaded ControlNet model with DisTorch2 distributed allocation applied. |
+
+## DisTorch2 Distributed Loading
+
+DisTorch2 is an advanced memory management system that enables loading and running large diffusion models across multiple GPUs by intelligently distributing tensor allocations. Instead of loading an entire model on a single device, DisTorch2 splits the model's layers across available devices while maintaining computational efficiency.
+
+### Key Concepts
+
+**Virtual VRAM Allocation**: Artificially increases the available VRAM on the compute device by borrowing memory capacity from donor devices through intelligent tensor distribution.
+
+**Expert Mode Allocations**: Advanced users can manually specify exactly how much of the model should be placed on each device using ratio or byte-based allocation strings.
+
+### Allocation Examples
+
+**Basic Virtual VRAM Mode**:
+- `compute_device`: `cuda:0`
+- `virtual_vram_gb`: `8.0`
+- `donor_device`: `cuda:1`
+- Result: Loads model as if cuda:0 had 8GB more VRAM available, using cuda:1 as memory donor.
+
+**Expert Ratio Allocation**:
+- `expert_mode_allocations`: `cuda:0,60%;cuda:1,30%;cpu,10%`
+- Distributes model layers with 60% on GPU 0, 30% on GPU 1, and 10% on CPU.
+
+**Expert Byte Allocation**:
+- `expert_mode_allocations`: `cuda:0,4gb;cuda:1,2gb;cpu,*`
+- Allocates exactly 4GB to cuda:0, 2GB to cuda:1, and remaining to CPU.
+
+**Mixed Mode**:
+Combines virtual VRAM with expert allocations for complex multi-device scenarios.