update readme
Browse files
README.md
CHANGED
|
@@ -43,7 +43,7 @@ model-index:
|
|
| 43 |
|
| 44 |
A compact RoBERTa-style **Masked Language Model (MLM)** for Persian (Farsi).
|
| 45 |
We trained a Persian BPE tokenizer on a mixed corpus combining formal text with social-media and chat data.
|
| 46 |
-
The model is pre-trained with
|
| 47 |
|
| 48 |
- **NER** on a **merged ARMAN + PEYMA** corpus
|
| 49 |
- **Relation Extraction** on **PERLEX**
|
|
|
|
| 43 |
|
| 44 |
A compact RoBERTa-style **Masked Language Model (MLM)** for Persian (Farsi).
|
| 45 |
We trained a Persian BPE tokenizer on a mixed corpus combining formal text with social-media and chat data.
|
| 46 |
+
The model is pre-trained with this tokenizer, optimized for Persian script and evaluated on two downstream tasks:
|
| 47 |
|
| 48 |
- **NER** on a **merged ARMAN + PEYMA** corpus
|
| 49 |
- **Relation Extraction** on **PERLEX**
|