PaddlePaddle/PaddleOCR-VL · To rename for future compatibility with transformers

To rename for future compatibility with transformers

#71

by xiaohei66 - opened 28 days ago

base: refs/heads/main

←

from: refs/pr/71

Discussion Files changed

+36

-63

xiaohei66

PaddlePaddle org 28 days ago

•

edited 28 days ago

Our vision encoder is a heavily modified version of SigLIP, featuring a dynamic resolution mechanism and 2D RoPE instead of the original’s fixed resolution and learnable absolute position embeddings.
This makes our implementation fundamentally different from the standard SigLIP in libraries like Transformers. To avoid future naming conflicts and confusion, we must move away from the Siglip* name.

xiaohei66 changed pull request title from rename to To rename for future compatibility with transformers 28 days ago

renmaed4027c07

xiaohei66 changed pull request status to open 28 days ago

xiaohei66 changed pull request status to merged 28 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment