zhijun.li
commited on
Commit
·
850ce03
1
Parent(s):
432f4e8
Adjusting annotations
Browse files
app.py
CHANGED
|
@@ -166,7 +166,7 @@ with gr.Blocks(title="Ernie 4.5 Video2Code", theme=gr.themes.Soft()) as demo:
|
|
| 166 |
# 修复2:将 open 设置为 True,默认展开
|
| 167 |
with gr.Accordion("📚 Technical Capabilities of ERNIE 4.5-VL", open=True):
|
| 168 |
gr.Markdown("""
|
| 169 |
-
This application is powered by **Baidu ERNIE 4.5
|
| 170 |
|
| 171 |
* **👁️ Multimodal Heterogeneous MoE**: Uses dedicated vision experts to process images and video frames without interfering with text generation capabilities.
|
| 172 |
* **⏳ 3D-RoPE Temporal Modeling**: Incorporates 3D Rotary Position Embeddings to independently encode temporal, width, and height information.
|
|
|
|
| 166 |
# 修复2:将 open 设置为 True,默认展开
|
| 167 |
with gr.Accordion("📚 Technical Capabilities of ERNIE 4.5-VL", open=True):
|
| 168 |
gr.Markdown("""
|
| 169 |
+
This application is powered by **Baidu ERNIE 4.5**, a state-of-the-art foundation model with specific enhancements for video understanding:
|
| 170 |
|
| 171 |
* **👁️ Multimodal Heterogeneous MoE**: Uses dedicated vision experts to process images and video frames without interfering with text generation capabilities.
|
| 172 |
* **⏳ 3D-RoPE Temporal Modeling**: Incorporates 3D Rotary Position Embeddings to independently encode temporal, width, and height information.
|