Steel OCR DET Model

鉄骨の手書き製品コードを検出する PP-OCRv5_mobile_det ファインチューニング済みモデル。パイプライン: 画像 → DET(本モデル) → crop → REC → 候補マッチング。

最新版: `augmented_v1-cleaned-20260602`(現 main)

学習 DET ラベルから 非コードの背景注釈(中国語看板・断片)を除去したクリーンデータ で augmented_v1 レシピを再学習したもの。背景文字を検出ターゲットに含めると過検出を誘発するため除去。

固定 test(14枚, 960px)	値
Precision	0.816
Recall	0.738
H-mean	0.775(旧 augmented_v1 = 0.727、+4.8pt)

clean val(コード入り70枚): P 0.745 / R 0.837 / H-mean 0.788(best_epoch 134)。

データ品質修正

steel-ocr-dataset-960 の DET ラベルに背景の中国語・断片が「正例」box として混入していた (train 188 / val 101 非コード box)。コード box のみ残してクリーン化(原寸クリーンデータと bbox 一致を確認)。

Configuration(augmented_v1)

Parameter	Value
Base Model	PP-OCRv5_mobile_det
Epochs	200
Batch Size	48
Learning Rate	0.001
Warmup	2
Weight Decay	5e-5
OHEM ratio	3
Augmentation	rotate ±20 + shear ±10 + blur/noise/brightness-contrast

ファイル

det_model.onnx — ONNX(モバイル / ONNX パイプライン用・推奨)
inference/ — PaddleOCR 3.x inference 形式(inference.json / .pdiparams / inference.yml)
best_accuracy.pdparams — 学習チェックポイント

Usage

from huggingface_hub import hf_hub_download
onnx = hf_hub_download("iput-tk230215/steel-ocr-det", "det_model.onnx",
                       revision="augmented_v1-cleaned-20260602")

Versions

タグでバージョン管理。旧版(汚染ラベル学習)は augmented_v1-20260119 等のタグから取得可。 Tags 参照。

Repository

https://github.com/iput-tk230215/solution_teamB_image-recog

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support