JanusFlow示例代码修复建议 #77

songafu · 2025-01-28T23:55:19Z

在使用JanusFlow的过程中，发现示例代码中的文生图部分在当前最新的Transformer版本（>=4.48.0）下无法正常运行且示例代码可能存在代码缺陷，建议修复如下：
（1）JanusFlow示例代码（文生图）中数据流的处理，存在变量引用逻辑错误
if step == 0:
outputs = vl_gpt.language_model.model(inputs_embeds=llm_emb,
use_cache=True,
attention_mask=attention_mask,
past_key_values=None)
past_key_values = []
for kv_cache in outputs.past_key_values: #should be outputs.past_key_values
k, v = kv_cache[0], kv_cache[1]
past_key_values.append((k[:, :, :inputs_embeds.shape[1], :], v[:, :, :inputs_embeds.shape[1], :]))
past_key_values = tuple(past_key_values)

（2）在当前最新的Transformer版本（>=4.48.0）下无法正常运行JanusFlow服务，报错如下，建议在Quick Start 提示用户选择使用较低版本的transformer（如4.38.2）或兼容最新的transformer版本修复。
llama/modeling_llama.py", line 551, in forward
past_seen_tokens = past_key_values.get_seq_length() if past_key_values is not None else 0
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'tuple' object has no attribute 'get_seq_length'

SimonYS001 · 2025-01-29T20:52:49Z

Good job!

scifisatan · 2025-02-04T14:34:08Z

just created a fix for this #137

Replace the code on line 108 - 122

if step == 0:
            outputs = vl_gpt.language_model.model(inputs_embeds=llm_emb, 
                                             use_cache=True, 
                                             attention_mask=attention_mask,
                                             past_key_values=None)
            past_key_values = []
            for kv_cache in past_key_values:
                k, v = kv_cache[0], kv_cache[1]
                past_key_values.append((k[:, :, :inputs_embeds.shape[1], :], v[:, :, :inputs_embeds.shape[1], :]))
            past_key_values = tuple(past_key_values)
        else:
            outputs = vl_gpt.language_model.model(inputs_embeds=llm_emb, 
                                             use_cache=True, 
                                             attention_mask=attention_mask,
                                             past_key_values=past_key_values)

with this

if step == 0:
            past_key_values = None  # Ensure it starts as None
        else:
            past_key_values = tuple(past_key_values) if past_key_values else None  # Convert only if it's valid

        outputs = vl_gpt.language_model.model(
            inputs_embeds=llm_emb, 
            use_cache=True, 
            attention_mask=attention_mask,
            past_key_values=past_key_values  # Now correctly assigned
        )

Hope it helps :)

nv-samcheng · 2025-02-15T21:02:30Z

Just want to confirm if the kv cache bu used in Janus Flow?

This was referenced Feb 4, 2025

Fix: Handle past_key_values AttributeError in generate Function #137

Open

AttributeError: 'tuple' object has no attribute 'get_seq_length' #99

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JanusFlow示例代码修复建议 #77

JanusFlow示例代码修复建议 #77

songafu commented Jan 28, 2025

SimonYS001 commented Jan 29, 2025

scifisatan commented Feb 4, 2025 •

edited

Loading

nv-samcheng commented Feb 15, 2025

JanusFlow示例代码修复建议 #77

JanusFlow示例代码修复建议 #77

Comments

songafu commented Jan 28, 2025

SimonYS001 commented Jan 29, 2025

scifisatan commented Feb 4, 2025 • edited Loading

nv-samcheng commented Feb 15, 2025

scifisatan commented Feb 4, 2025 •

edited

Loading