pollockjj
diff --git a/‎web/docs/CheckpointLoaderNF4MultiGPU.md‎
Lines changed: 15 additions & 0 deletions b/‎web/docs/CheckpointLoaderNF4MultiGPU.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎web/docs/DownloadAndLoadFlorence2ModelMultiGPU.md‎
Lines changed: 16 additions & 0 deletions b/‎web/docs/DownloadAndLoadFlorence2ModelMultiGPU.md‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎web/docs/DownloadAndLoadWav2VecModelMultiGPU.md‎
Lines changed: 20 additions & 0 deletions b/‎web/docs/DownloadAndLoadWav2VecModelMultiGPU.md‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎web/docs/FantasyTalkingModelLoaderMultiGPU.md‎
Lines changed: 19 additions & 0 deletions b/‎web/docs/FantasyTalkingModelLoaderMultiGPU.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎web/docs/Florence2ModelLoaderMultiGPU.md‎
Lines changed: 16 additions & 0 deletions b/‎web/docs/Florence2ModelLoaderMultiGPU.md‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎web/docs/LTXVLoaderMultiGPU.md‎
Lines changed: 15 additions & 0 deletions b/‎web/docs/LTXVLoaderMultiGPU.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎web/docs/LoadFluxControlNetMultiGPU.md‎
Lines changed: 15 additions & 0 deletions b/‎web/docs/LoadFluxControlNetMultiGPU.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎web/docs/LoadWanVideoClipTextEncoderMultiGPU.md‎
Lines changed: 25 additions & 0 deletions b/‎web/docs/LoadWanVideoClipTextEncoderMultiGPU.md‎
Lines changed: 25 additions & 0 deletions
diff --git a/‎web/docs/LoadWanVideoT5TextEncoderMultiGPU.md‎
Lines changed: 26 additions & 0 deletions b/‎web/docs/LoadWanVideoT5TextEncoderMultiGPU.md‎
Lines changed: 26 additions & 0 deletions
diff --git a/‎web/docs/WanVideoBlockSwapMultiGPU.md‎
Lines changed: 16 additions & 0 deletions b/‎web/docs/WanVideoBlockSwapMultiGPU.md‎
Lines changed: 16 additions & 0 deletions
@@ -0,0 +1,15 @@
+# CheckpointLoaderNF4MultiGPU
+
+`CheckpointLoaderNF4MultiGPU` wraps the NF4 checkpoint loader from `ComfyUI_bitsandbytes_NF4` so you can pick the execution device when working with 4-bit Quantised diffusion checkpoints.
+
+## Inputs
+
+All base parameters from `CheckpointLoaderNF4` are retained. The MultiGPU wrapper adds one optional field:
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | Device that should own the loaded NF4 checkpoint (GPU id or `cpu`). |
+
+## Outputs
+
+Outputs are identical to the upstream NF4 loader (UNet/CLIP/VAE tuple). The only behavioural change is the explicit device placement. |
@@ -0,0 +1,16 @@
+# DownloadAndLoadFlorence2ModelMultiGPU
+
+`DownloadAndLoadFlorence2ModelMultiGPU` mirrors the download-and-load helper supplied by `ComfyUI-Florence2`, but with explicit device and offload selection so large Florence2 checkpoints can live on secondary GPUs or CPU memory.
+
+## Inputs
+
+All original inputs from `DownloadAndLoadFlorence2Model` remain available. The MultiGPU wrapper introduces two optional selectors:
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | Compute device to host the model once loaded. |
+| `offload_device` | `STRING` | Device that receives automatic offloads (defaults to `cpu`). |
+
+## Outputs
+
+Outputs match the base Florence2 helper (model handle plus aux data). The only difference is that the returned model is already resident on the device you specified.
@@ -0,0 +1,20 @@
+# DownloadAndLoadWav2VecModelMultiGPU
+
+`DownloadAndLoadWav2VecModelMultiGPU` downloads a preset Wav2Vec2 checkpoint from Hugging Face (if missing) and loads it onto the device you choose, mirroring WanVideo's helper while adding MultiGPU awareness.
+
+## Inputs
+
+### Required
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `model` | `STRING` | Preset identifier (`TencentGameMate/chinese-wav2vec2-base` or `facebook/wav2vec2-base-960h`). |
+| `base_precision` | `STRING` | Weight precision (`fp32`, `bf16`, `fp16`). |
+| `load_device` | `STRING` | Wan loader slot (`main_device` or `offload_device`). |
+| `device` | `STRING` | MultiGPU device to run the audio model. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `wav2vec_model` | `WAV2VECMODEL` | Downloaded and loaded Wav2Vec2 model. |
@@ -0,0 +1,19 @@
+# FantasyTalkingModelLoaderMultiGPU
+
+`FantasyTalkingModelLoaderMultiGPU` loads FantasyTalking diffusion models with explicit device control, making it easier to keep speech animation workloads off your primary compute GPU.
+
+## Inputs
+
+### Required
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `model` | `STRING` | FantasyTalking model from `ComfyUI/models/diffusion_models`. |
+| `base_precision` | `STRING` | Precision for the weights (`fp32`, `bf16`, `fp16`). |
+| `device` | `STRING` | MultiGPU device that should host the model. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `model` | `FANTASYTALKINGMODEL` | Loaded FantasyTalking model bundle. |
@@ -0,0 +1,16 @@
+# Florence2ModelLoaderMultiGPU
+
+`Florence2ModelLoaderMultiGPU` wraps the Florence2 model loader so you can decide which device handles model inference and which device receives Wan/Comfy offloads. Use it exactly like the original node from `ComfyUI-Florence2`; all native inputs remain available.
+
+## Inputs
+
+All parameters from `Florence2ModelLoader` are still supported. The MultiGPU variant adds the following optional fields:
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | MultiGPU device used for runtime compute (`cuda:0`, `cuda:1`, `cpu`, etc.). |
+| `offload_device` | `STRING` | Device that receives automatic model offloads (defaults to `cpu`). |
+
+## Outputs
+
+The outputs are identical to the upstream Florence2 loader (model tuple, additional metadata). Use them interchangeably in existing workflows; only the device placement behaviour changes.
@@ -0,0 +1,15 @@
+# LTXVLoaderMultiGPU
+
+`LTXVLoaderMultiGPU` wraps `ComfyUI-LTXVideo`'s checkpoint loader so you can push LTX Video models to any GPU (or CPU) in your system without editing the base node.
+
+## Inputs
+
+Every input from the upstream `LTXVLoader` node is preserved. The MultiGPU version adds a single optional selector:
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | MultiGPU device that should host the loaded LTX Video checkpoint. |
+
+## Outputs
+
+Outputs are identical to the original LTX Video loader. The loader simply ensures the returned model already resides on the selected device.
@@ -0,0 +1,15 @@
+# LoadFluxControlNetMultiGPU
+
+`LoadFluxControlNetMultiGPU` exposes device selection for XLabAI's FLUX ControlNet loader, letting you keep the ControlNet on a secondary GPU or the CPU while the main FLUX UNet stays on your primary compute device.
+
+## Inputs
+
+All inputs from the upstream `LoadFluxControlNet` node remain unchanged. The MultiGPU variant introduces one optional field:
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | MultiGPU device that will host the ControlNet during inference. |
+
+## Outputs
+
+Outputs match the base FLUX ControlNet loader exactly; only the device placement differs.
@@ -0,0 +1,25 @@
+# LoadWanVideoClipTextEncoderMultiGPU
+
+`LoadWanVideoClipTextEncoderMultiGPU` loads WanVideo CLIP vision/text encoders on the device you specify, making it easy to keep encoders off your primary compute GPU when memory is tight.
+
+## Inputs
+
+### Required
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `model_name` | `STRING` | CLIP vision or text encoder model from `ComfyUI/models/clip_vision` or `ComfyUI/models/text_encoders`. |
+| `precision` | `STRING` | Weight precision for the model (`fp16`, `fp32`, or `bf16`). |
+
+### Optional
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | Target MultiGPU device to host the encoder. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `wan_clip_vision` | `CLIP_VISION` | Loaded CLIP vision/text module ready for image conditioning. |
+| `load_device` | `MULTIGPUDEVICE` | Device that now owns the encoder; feed into `WanVideoClipVisionEncode`. |
@@ -0,0 +1,26 @@
+# LoadWanVideoT5TextEncoderMultiGPU
+
+`LoadWanVideoT5TextEncoderMultiGPU` loads WanVideo T5 text encoders while letting you choose the MultiGPU device used for embedding work. The node returns both the encoder handle and the device string so downstream text nodes inherit placement automatically.
+
+## Inputs
+
+### Required
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `model_name` | `STRING` | T5 model from `ComfyUI/models/text_encoders`. |
+| `precision` | `STRING` | Base precision for the encoder (`fp32` or `bf16`). |
+
+### Optional
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| `device` | `STRING` | MultiGPU device (defaults to secondary GPU when available). |
+| `quantization` | `STRING` | Enable FP8 quantisation (`fp8_e4m3fn`) when supported. |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `wan_t5_model` | `WANTEXTENCODER` | Loaded Wan T5 encoder bundle. |
+| `load_device` | `MULTIGPUDEVICE` | Device string to reuse with `WanVideoTextEncode*` nodes. |
@@ -0,0 +1,16 @@
+# WanVideoBlockSwapMultiGPU
+
+`WanVideoBlockSwapMultiGPU` prepares block swap arguments for WanVideo models and adds an explicit `swap_device` selector so you can decide which device receives swapped transformer blocks.
+
+## Inputs
+
+| Parameter | Data Type | Description |
+| --- | --- | --- |
+| *(base Wan block swap inputs)* | *varies* | All parameters exposed by the upstream `WanVideoBlockSwap` node are available and behave identically. |
+| `swap_device` | `STRING` | Additional MultiGPU device option that picks the destination for swapped layers (`cpu`, `cuda:1`, etc.). |
+
+## Outputs
+
+| Output Name | Data Type | Description |
+| --- | --- | --- |
+| `block_swap_args` | `BLOCKSWAPARGS` | Configuration dictionary to feed into `WanVideoModelLoaderMultiGPU` or Wan samplers. |