-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the text2video result of 1.1 is worse than 1.0 #332
Comments
This issue is stale because it has been open for 7 days with no activity. |
We identify the issue. The problem is that we apply qk-norm before applying position encoding. Thus, when we generate longer video (more than 2s), the image quality deteriorates. We will fix this in the next release. |
This issue is stale because it has been open for 7 days with no activity. |
这个修复了吗 |
This issue is stale because it has been open for 7 days with no activity. |
This issue was closed because it has been inactive for 7 days since being marked as stale. |
【1.1 version】
I use the gradio/app.py
prompt: # prompt_text = "A bustling city street at night, filled with the glow of car headlights and the ambient light of streetlights. The scene is a blur of motion, with cars speeding by and pedestrians navigating the crosswalks. The cityscape is a mix of towering buildings and illuminated signs, creating a vibrant and dynamic atmosphere. The perspective of the video is from a high angle, providing a bird's eye view of the street and its surroundings. The overall style of the video is dynamic and energetic, capturing the essence of urban life at night."
I use the same prompt ,with version 1.0 , I find 1.1 is worse than 1.0
The text was updated successfully, but these errors were encountered: