zho-zho-zho

Open Source

ComfyUI-InstantID

![ISID_](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/01393483-3145-4691-9daa-7ce9035c9bd0) # ComfyUI InstantID Unofficial implementation of [InstantID](https://github.com/InstantID/InstantID) for ComfyUI ![Dingtalk_20240123182131](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/7a99b32c-b4a2-4c46-acb0-f796fc46f9ee) + pose_ref ![Dingtalk_20240124232946](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/caa60456-f2d8-4315-864b-659a9e7cea89) ## 项目介绍 | Info - 来自对[InstantID](https://github.com/InstantID/InstantID)的非官方实现 - 版本：V2.0 支持姿势参考图  ## 视频演示 V2.0 https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/083c9e5e-06a0-4623-b5ac-05f7e85a74f2 V1.0 https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/7295c0d7-1d1b-4044-aea3-8efa67047362 ## 节点说明 | Features - 基础模型加载 | base model loader - 📷ID Base Model Loader from hub 🤗：支持从 huggingface hub 自动下载模型，输入模型名称（如：wangqixun/YamerMIX_v8）即可 - 📷ID Base Model Loader locally：支持加载本地模型（需 SDXL 系列模型） - InsightFace 模型加载 | 📷InsightFace Loader - ：支持 CUDA 和 CPU - ID ControlNet 模型加载 | 📷ID ControlNet Loader - controlnet_path：ID ControlNet 模型地址 - Ipadapter_instantid 模型加载 | 📷Ipadapter_instantid Loader - Ipadapter_instantid_path：模型路径 - filename：模型名称 - 提示词 + 风格 | 📷ID Prompt_Styler - 与各种提示词（文本）输入（如肖像大师等）、styler、 Photomaker Prompt_Styler 兼容 - prompt、negative：正负提示词 - style_name：支持官方提供的8种风格 - (No style) - Watercolor - Film Noir - Neon - Jungle - Mars - Vibrant Color - Snow - Line art - InstantID 生成 | 📷InstantID Generation 🆕 - face_image：接入脸部参考图像 - pipe：接入模型 - insightface：接入 insightface 模型 🆕 - pose_image_optional（非必要）：接入姿势参考图像（注意：仅对面部周围姿势起效，与通常的 openpose 不同） - positivet、negative：正负提示词 - ip_adapter_scale：IPA 强度 - controlnet_conditioning_scale：ID Controlnet 强度 - step：步数，官方默认30步 - guidance_scale：提示词相关度，一般默认为5 - enhance_face_region：脸部增强选项 🆕 - seed：种子 ## 风格 | Styles ![ISID_STYLE](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/142bda7a-798b-46b3-aa69-1b88701c8311) ## 安装 | Install - 推荐使用管理器 ComfyUI Manager 安装（On the Way） - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID.git` 3. `cd custom_nodes/ComfyUI-InstantID` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 使用方法 | How to Use - 下载 [InstantID/ControlNetModel](https://huggingface.co/InstantX/InstantID/tree/main/ControlNetModel) 中的 config.json 和 diffusion_pytorch_model.safetensors ，将模型地址填入 📷ID ControlNet Loader 节点中（例如：ComfyUI/custom_nodes/ComfyUI-InstantID/checkpoints/controlnet） - 下载 [InstantID/ip-adapter](https://huggingface.co/InstantX/InstantID/tree/main) 中的 ip-adapter.bin ，将其地址填入 📷Ipadapter_instantid Loader 节点中（例如：ComfyUI/custom_nodes/ComfyUI-InstantID/checkpoints） - 下载 [DIAMONIK7777/antelopev2](https://huggingface.co/DIAMONIK7777/antelopev2/tree/main) 中的所有模型，将其放入 ComfyUI//custom_nodes/ComfyUI-InstantID/models/antelopev2 中 - 兼容性： CUDA11 支持默认安装的 onnxruntime-gpu（1.16.0），如果是 CUDA12 则需手动安装 onnxruntime-gpu==1.17.0 [地址](https://dev.azure.com/onnxruntime/onnxruntime/_artifacts/feed/onnxruntime-cuda-12/PyPI/onnxruntime-gpu/overview/1.17.0) ## 工作流 | Workflows V2.0 - [V2.0 InstantID_pose_ref + ArtGallery](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V2.0%20InstantID_pose_ref%20%2B%20ArtGallery%20%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240124232833](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/99be9592-775d-4c33-bafc-5bd5c95a7222) - [V2.0 自动下载 huggingface hub](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V2.0%20InstantID_fromhub_pose_ref%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240124230145](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/95c4a1dd-864d-4a46-8c45-a48866aef29f) - [V2.0 InstantID_locally_pose_ref](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V2.0%20InstantID_locally_pose_ref%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240124230609](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/d4c22389-f853-44bd-9ea2-568b2ac7ed06) V1.0 工作流仅适用于V1.0 版本 - [V1.0 InstantID + ArtGallery](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V1.0%20InstantID%20%2B%20ArtGallery%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240123182440](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/c6ee25bf-a528-4d78-9b35-f5b0d0303601) - [V1.0 本地模型 locally](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V1.0%20InstantID_locally%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240123175624](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/459bfede-59e8-4d8d-941c-a950c4827c49) - [V1.0 自动下载 huggingface hub](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/blob/main/INSTANTID%20WORKFLOWS/V1.0%20InstantID_fromhub%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240123174950](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/50133961-1752-4ec8-ac0b-068d998b8534) ## 更新日志 - 20240124 更新为 V2.0 ：新增姿势参考图、优化代码修复 insightfaceloader 冲突问题修复 onnxruntime-gpu 版本兼容性的问题 - 20240123 V1.0 上线：同时支持本地、huggingface hub托管模型，支持8种风格 - 20240122 创建项目 ## 速度实测 | Speed - V1.0 - A100 50步 14s ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID/assets/140084057/dc535e67-3f56-4faf-be81-621b84bb6ee2) ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-InstantID&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-InstantID&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [InstantID](https://github.com/InstantID/InstantID) 📷InsightFace Loader 代码修改自 [ComfyUI_IPAdapter_plus](https://github.com/cubiq/ComfyUI_IPAdapter_plus)，感谢 [@cubiq](https://github.com/cubiq)！感谢 [@hidecloud](https://twitter.com/hidecloud) 对 onnxruntime 版本兼容性的测试与反馈！感谢 [esheep](https://www.esheep.com/) 技术人员对节点冲突问题的反馈！

AI Tools

1.4K Github Stars

Open Source

ComfyUI-BRIA_AI-RMBG

![BR_](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG/assets/140084057/c8167676-c347-468a-a719-aee3c4c69310) # ComfyUI-BRIA_AI-RMBG Unofficial [BRIA Background Removal v1.4](https://huggingface.co/briaai/RMBG-1.4) of BRIA RMBG Model for ComfyUI ![Dingtalk_20240207145631](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG/assets/140084057/f22fcbc4-f223-44be-bbcd-93e2e55937de) ## 项目介绍 | Info - 对[BRIA Background Removal v1.4](https://huggingface.co/briaai/RMBG-1.4)的非官方实现 - BRIA Background Removal v1.4：由 BRIA AI 开发，可作为非商业用途的开源模型 - 版本：**V1.5** 支持批量处理（可去除视频背景）、新增输出 mask 功能 ## 视频演示 SVD1.1 + RMBG 1.4 = https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG/assets/140084057/fdae7655-bcd0-4250-9d9b-a88b59f80d43 ## 安装 | Install - 推荐使用管理器 ComfyUI Manager 安装（On the Way） - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG.git` 3. 重启 ComfyUI ## 使用说明 | How to Use - 将 [Removal v1.4](https://huggingface.co/briaai/RMBG-1.4) 模型下载至`/custom_nodes/ComfyUI-BRIA_AI-RMBG/RMBG-1.4` - 节点： ![Dingtalk_20240207154339](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG/assets/140084057/70b9089e-81e8-4853-8564-83145f881175) - 🧹BRIA_RMBG Model Loader：自动加载 Removal v1.4 模型 - 🧹BRIA RMBG：去除背景 ## 更新日志 - 20240207 V1.5 支持批量处理、新增输出 mask 功能创建项目 V1.0 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [BRIA Background Removal v1.4](https://huggingface.co/briaai/RMBG-1.4) 代码参考了 [@camenduru](https://twitter.com/camenduru) 感谢！

AI & Machine Learning LLM Tools & Chat UIs

832 Github Stars

Open Source

ComfyUI-PhotoMaker-ZHO

![PNSTYLE_23png](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/15f9ebaf-b205-4cbd-928e-eca1a0cacb7f) # ComfyUI PhotoMaker Unofficial implementation of [PhotoMaker](https://github.com/TencentARC/PhotoMaker) for ComfyUI  ![Dingtalk_20240118163802](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/0292bf55-21b7-4025-bc27-7e3e7ccc2af3)  单张参考与多张参考的对比： ![Dingtalk_20240117201650](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/e7bccd61-7855-46c2-a6bc-31b34e742927) ![Dingtalk_20240117201201](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/6bbcfcf9-9027-4c6f-9be1-750971b7848c) ## 项目介绍 | Info - 来自对[PhotoMaker](https://github.com/TencentARC/PhotoMaker)的非官方实现 - 版本：V2.5 支持lora、支持多批次、支持通用的styler  ![Dingtalk_20240119194547](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/d067fc21-3b51-44bc-b76e-9351a7f6966a) ## 视频演示  https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/d58af6e7-d0f3-41ff-ab33-195cb6d66e9e ## 节点说明 | Features - 基础模型加载 | base model loader - 📷Base Model Loader from hub🤗：支持从 huggingface hub 自动下载模型，输入模型名称（如：SG161222/RealVisXL_V3.0）即可 - 📷Base Model Loader locally：支持加载本地模型（需 SDXL 系列模型） - PhotoMaker Adapter 模型加载 | PhotoMaker Adapter Loader - 📷PhotoMaker Adapter Loader from hub🤗：支持从 huggingface hub 自动下载模型 - 📷PhotoMaker Adapter Loader locally：支持加载本地模型，输入 photomaker-v1.bin 模型所在路径即可 - 参考图预处理 | 📷Ref Image Preprocessing - 直接模式 | Direct_input：接入单/多张图像（非必要项） - 路径模式 | Path_input：自动读取路径中的所有图像 - Lora模型加载 | 📷LoRALoader 🆕 - 支持加载本地 lora 模型 - 支持权重调节 - 提示词 + 风格 | 📷Prompt_Styler 🆕 - 与各种提示词（文本）输入（如肖像大师等）、styler兼容 - prompt、negative：正负提示词 - 支持权重调节 - style_name：支持官方提供的10种风格 - (No style) - Cinematic - Disney Charactor - Digital Art - Photographic (Default) - Fantasy art - Neonpunk - Enhance - Comic book - Lowpoly - Line art - PhotoMaker 生成 | 📷PhotoMaker Generation 🆕 - pipe：接入模型 - pil_image：接入预处理图像 - positivet、negative：正负提示词 - batch_size：生成数量 - style_strength_ratio：风格混合强度（高于30按30计算） - step：步数，官方默认50步，但毕竟是基于SDXL模型，我实测下来30步足够了 - guidance_scale：提示词相关度，一般默认为5 - width、height：尺寸设置（需1024维度） - seed：种子  ## 风格 | Styles ![PNSTYLE_2](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/dc675478-47a0-456d-946b-0cf781aa4c28) ## 安装 | Install  - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker.git` 3. `cd custom_nodes/ComfyUI-PhotoMaker` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 工作流 | Workflows - [V2.5 Disney-Character_PhotoMaker + DragNUWA](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/blob/main/PhotoMaker%20Workflows/Disney-Character_PhotoMaker%2BDragNUW%E3%80%90Zho%E3%80%91.json) 🆕 https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/ca2bfff4-701c-4960-ac11-b893f90c044c - [V2.5 lora + batch](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/blob/main/PhotoMaker%20Workflows/PhotoMaker_lora_batch%E3%80%90Zho%E3%80%91.json) 🆕 ![Dingtalk_20240119202403](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/b862b89f-1609-43d9-84a1-5f11a2d1ab2d) - [V2.5 portraitmaster + styler + lora](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/blob/main/PhotoMaker%20Workflows/PhotoMaker_lora_portrait_styler%E3%80%90Zho%E3%80%91.json) 🆕 ![Dingtalk_20240119201125](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/38e01035-139e-4a89-8982-6f7168684045) - [V2.5 本地模型 locally](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/blob/main/PhotoMaker%20Workflows/V2.5%20PhotoMaker_locally%E3%80%90Zho%E3%80%91.json) - [V2.5 自动下载 huggingface hub](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/blob/main/PhotoMaker%20Workflows/V2.5%20PhotoMaker_fromhub%E3%80%90Zho%E3%80%91.json) （注：特意补充了 V2.5 的基础工作流，V2.0 工作流已弃用）  ## 更新日志 - 20240122 新增 Disney-Character: PhotoMaker + DragNUWA 工作流，可配合 DragNUWA 融合多个人物特征生成迪士尼风格大片！ - 20240121 修复 bfloat16 bug - 20240120 补充两个基础工作流 - 20240119 更新为 V2.5：支持lora、支持自定义生成数量、支持通用提示词输入（文本）如：styler、portraitmater等新增 lora + batch、portraitmaster + styler + lora 两个工作流 - 20240119 更新为 V2.1：参考图改为直接输入/路径输入两种新模式，其中直接输入支持多图 ![Dingtalk_20240119022341](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/78595f2c-7f87-477a-9896-007dd24fe8c9) - 20240118 更新为 V2.0：节点拆分 + 支持本地模型 + 支持自定义尺寸 +提速3倍新增本地、hub加载工作流 - 20240117 新增单张图输入，并给出对比图修复bug，初版上线 - 20240116 创建项目 ## 速度实测 | Speed - V2.0 提速 3 倍 - A100 50步 7s ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/4ae13ffc-c770-4551-bcb2-ce0b0ddc1367) - V1.5 - A100 50步 23s ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/df6eacda-2640-425b-b5ca-1ab5a8a61a66) - v100 50步 90s ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker/assets/140084057/973b8b6b-9195-4044-b75d-bd833bd6421e) ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-PhotoMaker&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-PhotoMaker&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits 感谢[@erLin](https://twitter.com/eviljer)对ComfyUI 的图像张量 Shape (N, H, W, C)的提醒，帮助我成功修复了bug！ [PhotoMaker](https://github.com/TencentARC/PhotoMaker)

AI & Machine Learning LLM Tools & Chat UIs

812 Github Stars

Open Source

ComfyUI-Gemini

<p align="center"><img src="https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/ec15bc39-8111-417b-afc5-67cdfb3a9df5" alt="Gemini项目图" /></p>  <h1 align="center">Gemini in ComfyUI</h1>  🆕 最新 Gemini 1.5 Pro 模型已加入！ ![G1 5发](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/8c3105c7-e165-4b6b-8a46-6ed09ecfb388) - 支持系统指令设置（System Instruction） - 支持多模态 + 多轮对话 - 可以读取视频、音频等文件（上限 20G） - 支持输入的 token 上限达到了 104万8576 - 目前速率限制比较严，每分钟只有 2 次，每天只有 1000 次  ![Dingtalk_20240411194828](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/c1f71846-db5e-4bb1-a1b7-0b278ee95ea8) 已支持文件上传功能，不过还仅限于单个文件（图片、txt文件、pdf文件、音频mp3文件等），未来会支持多文件上传（用于读取视频）  All-in-One LoRa Training 预处理、自动打标、训练、测试 LoRA 一条龙工作流 https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/d461f656-6888-48a8-b4f8-b70b7e46504d V2.0 聊天机器人节点 https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/cb64ee29-a983-47fd-b26b-55386314afdd 将 Gemini pro vision 用于批量打标 https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/00addc94-1587-40d6-83f4-6b47dc1db665 ## 项目介绍 | Info - 将 Google Gemini 引入到 ComfyUI 中，现在你可以用它为你生成提示词、描述图像，也可与它畅聊人生 - 目前 Gemini API 免费开放（20240502 开始计费），你可以在这里申请一个自己的 API Key：[Gemini API 申请](https://makersuite.google.com/app/apikey) - 最新 Gemini 1.5 Pro 模型： - 支持系统指令设置（System Instruction） - 支持多模态 + 多轮对话， - 可以读取视频、音频等文件（上限 20G） - 支持输入的 token 上限达到了 104万8576 - 目前速率限制比较严，每分钟只有 2 次，每天只有 1000 次 - 版本：V3.0 新增 Gemini 1.5 Pro 模型、系统指令、文件上传 ## 详细说明 | Features - Gemini 目前提供 3 种模型： - Gemini-pro: 文本模型 - Genimi-pro-vision: 文本 + 图像模型 - Gemini 1.5 Pro：文本 + 图像 + 文件（音频、视频等各类）模型 - Gemini 1.5 Pro 新节点： - 🆕Gemini_15P_Advance_Zho：支持系统指令设置（System Instruction） - 🆕Gemini_15P_Chat_Advance_Zho：支持系统指令设置（System Instruction）+ 多轮对话 - 📄Gemini_FileUpload_Zho：支持单文件上传（图片、音频、文本txt、pdf等），暂未支持视频（多文件）上传 - 📄Gemini_File_Zho：文件读取对话，最大 token 数为 1048576 - 2 类节点: - 隐式 API KEY：将 Gemini_API_Key 设置为了环境变量，更安全，方便分享工作流（不会外泄 API KEY）㊙️Gemini_Zho：同时支持 3 种模型，其中 Genimi-pro-vision 和 Gemini 1.5 Pro 可接受图像作为输入㊙️Gemini_Vsion_ImgURL_Zho：Genimi-pro-vision 和 Gemini 1.5 Pro 模型，接受图像链接作为输入㊙️Gemini_Chat_Zho：Genimi-pro 和 Gemini 1.5 Pro 模型，支持上下文对话，聊天机器人，Gemini 1.5 Pro 支持图像输入的上下文对话 - 显式API KEY：直接在节点中输入 Gemini_API_Key，仅供个人私密使用，请勿将包含 API KEY 的工作流分享出去 ✨Gemini_API_Zho：同时支持 3 种模型，其中 Genimi-pro-vision 和 Gemini 1.5 Pro 可接受图像作为输入 ✨Gemini_API_Vsion_ImgURL_Zho：Genimi-pro-vision 和 Gemini 1.5 Pro 模型，接受图像链接作为输入 ✨Gemini_API_Chat_Zho：：Genimi-pro 和 Gemini 1.5 Pro 模型，支持上下文对话，聊天机器人，Gemini 1.5 Pro 支持图像输入的上下文对话 - 辅助节点: - ✨DisplayText_Zho：显示文本 - ✨ConcatText_Zho：使用 “，” 连接文本 - 节点示例：  ![Dingtalk_20231220180446](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/3cba8d69-09bb-470c-940c-7f796c869d63) 聊天机器人 ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/8a38f437-0148-4777-b872-e88995dd53d2) ## 参数说明 | Parameters - image（非必要）：选择 Gemini-pro 时无需接入图像，选择 Genimi-pro-vision 时需要接入图像，选择 Gemini 1.5 Pro 既可接入也可不接 - prompt：提示词 - model_name：模型选择，Gemini-pro 或 Genimi-pro-vision 或 Gemini 1.5 Pro - stream：流式传输响应 - api_key：输入 Gemini_API_Key （仅在显式节点上有） ## 使用方法 | How to use - 首先需要申请一个自己的 Gemini_API_Key：[Gemini API 申请](https://makersuite.google.com/app/apikey) - 选择隐式节点㊙️（推荐）：将你的 Gemini_API_Key 添加到 `config.json` 文件中，运行时会自动加载 - 选择显示节点✨：直接将 Gemini_API_Key 输入到节点的 api_key 中（注意：请勿将包含此节点的工作流分享出去，以免泄露你的 API Key） - 使用注意：本地使用请确保你可以有效连接到 Google Gemini 的服务，推荐使用 Colab 或 Kaggle（无连接问题） - 使用新版 Gemini 1.5 Pro 需要更新依赖 google-generativeai > 0.4.1 ## 安装 | Install - 推荐使用管理器 ComfyUI Manager 安装 - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini.git` 3. `cd custom_nodes/ComfyUI-Gemini` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 工作流 | Workflow ### V3.0 平替 DALL·3 [Gemini 1.5 Pro + Stable Diffusion + ComfyUI = DALL·3 ](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/blob/main/Gemini_workflows/Gemini%201.5%20Pro%20%2B%20Stable%20Diffusion%20%2B%20ComfyUI%20%3D%20DALL%C2%B73%20%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240411195451](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/4e32d13d-615e-4441-931e-233c07cc958c) ### V2.0 工作流（隐式）（V1.1工作流依旧可用） [Gemini-pro Chatbot【Zho】](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/blob/main/Gemini_workflows/Gemini-pro%20Chatbot%E3%80%90Zho%E3%80%91.json) ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/8a38f437-0148-4777-b872-e88995dd53d2) ### V1.1 工作流（隐式） [Gemini-pro【Zho】](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/blob/main/Gemini_workflows/Gemini-pro%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20231220183708](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/7f0e222a-2de4-4c5b-883a-2172667d1d5b) [Genimi-pro-vision【Zho】](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/blob/main/Gemini_workflows/Gemini-pro-vision%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20231220192932](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/db4f4bf6-a0cf-42af-ac5a-7e2afd1bda93) ![Dingtalk_20231220190218](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini/assets/140084057/5bb57f7b-f00f-454a-9435-c1b8a02ae71a) ## 更新日志 | Changelog 20240411 - V3.0版：新增 Gemini 1.5 Pro 模型、系统指令、文件上传 - 新增平替 DALL·3 工作流（Gemini 1.5 Pro + Stable Diffusion + ComfyUI = DALL·3） 20231229 - V2.1版：修复 Deadline of 60.0s bug，方法来自官方：https://github.com/google/generative-ai-python/issues/117 20231222 - V2.0版：新增上下文聊天节点，相当于聊天机器人 - 💬Gemini_Chat_Zho（隐式） - 💬Gemini_API_Chat_Zho（显示） 20231221 - V1.1版：修改 API KEY 的加载方式为自动添加 config.json ，将 API KEY 写入即可 - 已登陆 manager 不用手动安装了 20231220 - 实现 Genimi-pro-vision 模型调用，支持图像或图像链接输入 - 增加隐式节点，更加安全 - 增加辅助节点 20231219 - 创建 ComfyUI Gemini 项目，实现 Gemini-pro 模型调用 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-Gemini&type=Timeline)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-Gemini&Timeline) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits - DisplayText节点参考了：[ComfyUI_Custom_Nodes_AlekPet](https://github.com/AlekPet/ComfyUI_Custom_Nodes_AlekPet)，感谢 AlekPet ！

AI & Machine Learning LLM Tools & Chat UIs

788 Github Stars

Open Source

ComfyUI-BiRefNet-ZHO

![BRF](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO/assets/140084057/ad43b70b-370b-49ca-94df-46039a224ce5) # ComfyUI-BiRefNet-ZHO Better version for [BiRefNet](https://github.com/zhengpeng7/birefnet) in ComfyUI | Both img & video ![Dingtalk_20240401154248](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO/assets/140084057/1d72a017-5dc5-482e-a0c7-284d14b807b7) ## 项目介绍 | Info - 对 [BiRefNet](https://github.com/zhengpeng7/birefnet) 的非官方实现 - 与 [viperyl/ComfyUI-BiRefNet](https://github.com/viperyl/ComfyUI-BiRefNet) 插件区别： - 原版插件：只能简单输出蒙版，不方便用，也不能处理视频 - 新版插件： 1）模型加载和图像处理相分离，提升速度（和我之前做的 [BRIA RMBG in ComfyUI](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BRIA_AI-RMBG) 插件一致） 2）可以直接输出透明背景的 PNG 图 3）可以直接抠视频 - BiRefNet 模型：目前最好的开源可商用背景抠除模型 - 版本：**V1.0** 同时支持图像和视频处理 ## 视频演示 https://github.com/ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO/assets/140084057/ae337aa0-ec9e-40c3-85d4-390654bd0cc7 ## 安装 | Install - 所需依赖：timm，如已安装无需运行 requirements.txt，只需 git 项目即可 - 推荐使用管理器 ComfyUI Manager 安装（On the Way） - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO.git` 3. `cd custom_nodes/ComfyUI-BiRefNet-ZHO` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 使用说明 | How to Use - 将 [BiRefNet](https://huggingface.co/ViperYX/BiRefNet) 中的 6 个模型均下载至`./models/BiRefNet` - 节点： ![Dingtalk_20240331031811](https://github.com/ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO/assets/140084057/ce37a57e-c5d0-4f17-9a87-457dd1022776) - 🧹BiRefNet Model Loader：自动加载 BiRefNet 模型 - 🧹BiRefNet：去除背景 ## 更新日志 - 20240401 V1.0 同时支持图像和视频处理（支持批量处理）、支持输出 mask 功能创建项目 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-BiRefNet-ZHO&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [BiRefNet](https://github.com/zhengpeng7/birefnet) 代码参考了 [viperyl/ComfyUI-BiRefNet](https://github.com/viperyl/ComfyUI-BiRefNet) 感谢！

AI & Machine Learning LLM Tools & Chat UIs

405 Github Stars

Open Source

ComfyUI-APISR

![APISR](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/740def33-964b-47ab-a2f0-e3a11fcbbd1c) # APISR IN COMFYUI Unofficial implementation of [APISR](https://github.com/Kiteretsu77/APISR) for ComfyUI https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/e6deb435-d276-4726-9d6d-457cc99d433e ## 项目介绍 | Info - 对 [APISR](https://github.com/Kiteretsu77/APISR) 的非官方实现 - APISR：专门用于动漫的超分模型，包含 2x 和 4x 双模型，速度飞快，效果很好 - 版本：V1.1 新增支持 cpu 生成，同时支持图像和视频放大（视频分为Batch和Lterative两种方式，分别适用于高/低显存） ![Dingtalk_20240319200511](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/a6aaccf9-01e6-4c79-a9bf-6beb830e572a) ## 节点说明 | Features - APISR 模型加载 | 🔎APISR ModelLoader - 支持 2 种官方模型：[2x_APISR_RRDB_GAN_generator](https://huggingface.co/camenduru/APISR/resolve/main/2x_APISR_RRDB_GAN_generator.pth?download=true) 和 [4x_APISR_GRL_GAN_generator](https://huggingface.co/camenduru/APISR/resolve/main/4x_APISR_GRL_GAN_generator.pth?download=true)，需手动下载放入 `/ComfyUI/models/apisr` 中 - 放大（批） | 🔎APISR - 同时支持图像和视频 - 批处理，帧数多的视频需要高显存 - 放大（逐张）| 🔎APISR Lterative - 同时支持图像和视频 - 逐张处理，低显存推荐使用 - 若输出配合 Video Helper Suite 插件使用，则需要使用 ComfyUI 自带的 Split Image with Alpha 节点去除 Alpha 通道 ![Dingtalk_20240319202553](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/5cc2c2fc-dc09-44e2-a363-831910f77172) ## 安装 | Install - 推荐使用管理器 ComfyUI Manager 安装 - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR` 3. `cd custom_nodes/ComfyUI-APISR` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 工作流 | Workflows V1.0 - [V1.0 APISR img or video Batch](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/blob/main/APISR%20WORKFLOWS/APISR%20img%20or%20video%20Batch%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240319195936](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/2dc21ac0-6ca4-45a6-8009-29f0eece7426) - [V1.0 APISR img or video Lterative](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/blob/main/APISR%20WORKFLOWS/APISR%20img%20or%20video%20Lterative%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240319203321](https://github.com/ZHO-ZHO-ZHO/ComfyUI-APISR/assets/140084057/9ebc1153-2d68-4fa2-b24a-b7bb8ebe437a) ## 更新日志 - 20240319 V1.1 新增支持 cpu 生成（from：Fihade） - 20240319 V1.0 同时支持图像与视频的放大，还提供分别适合于高显存和低显存的 Batch 和 Lterative 两种模式创建项目 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-APISR&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-APISR&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [APISR](https://github.com/Kiteretsu77/APISR)

AI Tools ML Frameworks

387 Github Stars

Open Source

ComfyUI-StableDiffusion3-API

![ComfyUI_temp_xcgvh_00014_](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/39a8e52b-0df3-462a-b2e5-2bcae481f8ea) # ComfyUI Stable Diffusion 3 API  ![Dingtalk_20240418031631](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/3c3f9b8a-c219-47c4-88d3-b52cece676da) ![Dingtalk_20240418031914](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/8b85f15c-b3f6-421a-831e-e05fccc097a6) SD3 TUrbo & Img2Img ![Dingtalk_20240418160608](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/6c030471-5570-42d9-9931-c1455d7313b2) ## 项目介绍 | Info - 通过 API 将 Stable Diffusion 3 引入 ComfyUI - Stable Diffusion 3：目前通过 API 开放，[详情](https://stability.ai/news/stable-diffusion-3-api?utm_source=twitter&utm_medium=website&utm_campaign=blog)，API申请：[Stability API key](https://platform.stability.ai/account/keys)，每个账户提供 25 个免费积分 - 模型： - SD3 6.5 积分/张 - SD3 Turbo 4 积分/张 - 版本：V1.5 支持多种比例的 SD3 文生图，修复图生图和 Turbo 的 bug，可以正常使用了 ![Dingtalk_20240418004328](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/89952a6b-b46b-419b-a429-2771621998a7) ## 节点说明 | Features - 🔥Stable Diffusion 3 - positive：正向提示词 - negative：负向提示词（Turbo 模型不支持） - aspect_ratio：画面比例，共 9 种："21:9", "16:9", "5:4", "3:2", "1:1", "2:3", "4:5", "9:16", "9:21"（图生图不适用） - mode：文生图或图生图 - model：SD3 或 SD3 Turbo - seed：种子 - image：非必要，仅用于图生图 - strength：非必要，仅用于图生图 ![Dingtalk_20240418025505](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/47b122f9-3864-47fe-ae34-f2a25275a701) ## 安装 | Install - 使用前请先申请 API ：[Stability API key](https://platform.stability.ai/account/keys)，每个账户提供 25 个免费积分 - 将 Stability API key 添加到 config.json 文件中，运行时会自动加载 - 推荐使用管理器 ComfyUI Manager 安装（On The Way） - 手动安装： 1. `cd custom_nodes` 2. `git clone https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API` 3. 重启 ComfyUI ## 工作流 | Workflows V1.0 - [V1.0 SD3 API]() ![Dingtalk_20240418030657](https://github.com/ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API/assets/140084057/c8c1f265-7f59-430d-ac00-0713019cbe44)  ## 更新日志 - 202418 V1.5 20240418 修复图生图和 Turbo 的 bug，可以正常使用了 V1.0 支持多种比例的 SD3 文生图（SD3 Turbo 和图生图功能还有问题）创建项目 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API&type=Date)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-StableDiffusion3-API&Date) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [Stable Diffusion 3](https://stability.ai/news/stable-diffusion-3-api?utm_source=twitter&utm_medium=website&utm_campaign=blog)

WordPress Themes & Plugins LLM Tools & Chat UIs

262 Github Stars

Open Source

ComfyUI-Qwen-VL-API

![qwenvl](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/db84bdde-e2be-48fa-8ce5-cdd5ee2dd057) <h1 align="center">QWen-VL in ComfyUI</h1> ![Dingtalk_20240130191521](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/5d10adba-90a6-48e0-94de-33d10b5d32f9) ## 项目介绍 | Info - 将阿里 [QWen-VL](https://github.com/QwenLM/Qwen-VL) 双模型（Plus & Max）通过 API 调用引入到 ComfyUI 中，初测下来 QWen-VL 是目前开源世界最好的视觉模型 - 目前 QWen-VL API 免费开放（🆕刚收到阿里的通知：3.18开始正式收费！收费标准见下图），你可以在这里申请一个自己的 API Key：[QWen-VL API 申请](https://help.aliyun.com/zh/dashscope/developer-reference/activate-dashscope-and-create-an-api-key) ![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/8e28aa5a-637b-43b6-b43b-a98da7d01779) - 版本：V1.0 支持单/多轮对话双模式、支持读取本地图像 ## 视频演示 https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/430d4ea1-6484-46e2-85bf-ad7cf95dda95 ## 详细说明 | Features - QWen-VL 目前提供 2 种模型：  | 模型 | 说明 | |--------------|---------------------------| | QWen-VL-Plus | 通义千问大规模视觉语言模型增强版。大幅提升细节识别能力和文字识别能力，支持超百万像素分辨率和任意长宽比规格的图像。在广泛的视觉任务上提供卓越的性能。| | QWen-VL-Max | 通义千问超大规模视觉语言模型。相比增强版，再次提升视觉推理能力和指令遵循能力，提供更高的视觉感知和认知水平。在更多复杂任务上提供最佳的性能。 | - 节点（均采用隐式 API KEY）: - ㊙️QWenVL_Zho：同时支持两种模型，接受本地图像作为输入（图像仅临时储存用完会自动清除） - ㊙️QWenVL_Chat_Zho：同时支持两种模型，支持上下文窗口，接受本地图像作为输入（图像储存在 /custom nodes/ComfyUI-Qwen-VL-API/qw 文件夹中，可手动清理） - 节点示例 & 与 Gemini-Pro-Vision 对比（描述更准更详细，且支持上下文多轮对话，Gemini-Pro-Vision 仅支持单轮对话）： ![Dingtalk_20240130191546](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/0806782a-341b-4d53-b482-04f515db2bc3) ![Dingtalk_20240130133911](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/1c62f1ae-2832-4b07-9f04-0b835f8c7a8f) - 上下文多轮对话： ![Dingtalk_20240130174301](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/e63c1ea1-c96a-4550-8c40-7af54a2f4262) ## 参数说明 | Parameters - image：接入本地图像 - prompt：提示词 - model_name：模型选择，QWen-VL-Plus 或 QWen-VL-Max - seed：随机种子 ## 使用方法 | How to use - 首先需要申请一个自己的 QWen-VL_API_Key：[QWen-VL API 申请](https://help.aliyun.com/zh/dashscope/developer-reference/activate-dashscope-and-create-an-api-key) - 将你的 QWen-VL_API_Key 添加到 `config.json` 文件中，运行时会自动加载 - 输出节点可配合像[ComfyUI-Gemini](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini)中 ✨DisplayText_Zho 一样的任何接受文本的节点 ## 安装 | Install - 推荐使用管理器 ComfyUI Manager 安装（ON THE WAY） - 手动安装： 1. `cd custom_nodes` 2. `https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API.git` 3. `cd custom_nodes/ComfyUI-Qwen-VL-API` 4. `pip install -r requirements.txt` 5. 重启 ComfyUI ## 工作流 | Workflow ### V1.0 工作流 [Qwen-VL V1.0【Zho】](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/blob/main/QWEN-VL%20WORKFLOWS/Qwen-VL%20V1.0%E3%80%90Zho%E3%80%91.json) ![Dingtalk_20240130200115](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API/assets/140084057/bae0447b-a4bf-45a3-ad27-c496a02bd6d2) ## 更新日志 | Changelog 20240130 - V1.0版：支持单/多轮对话双模式、支持读取本地图像 - 创建项目 ## Stars [![Star History Chart](https://api.star-history.com/svg?repos=ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API&type=Timeline)](https://star-history.com/#ZHO-ZHO-ZHO/ComfyUI-Qwen-VL-API&Timeline) ## 关于我 | About me 📬 **联系我**： - 邮箱：[email protected] - QQ 群：839821928 🔗 **社交媒体**： - 个人页：[-Zho-](https://jike.city/zho) - Bilibili：[我的B站主页](https://space.bilibili.com/484366804) - X（Twitter）：[我的Twitter](https://twitter.com/ZHOZHO672070) - 小红书：[我的小红书主页](https://www.xiaohongshu.com/user/profile/63f11530000000001001e0c8?xhsshare=CopyLink&appuid=63f11530000000001001e0c8&apptime=1690528872) 💡 **支持我**： - B站：[B站充电](https://space.bilibili.com/484366804) - 爱发电：[为我充电](https://afdian.net/a/ZHOZHO) ## Credits [QWen-VL](https://github.com/QwenLM/Qwen-VL)

AI Tools LLM Tools & Chat UIs

219 Github Stars

Open Source

ComfyUI-Assistant-GPTs

GPTs for ComfyUI | Explanation Creation Resolution | ComfyUI 助手

AI Tools Live Chat & Chatbots

148 Github Stars

Software by zho-zho-zho