入力劣化に対する量子化のロバスト性：物体検出のケーススタディ

要旨

ポストトレーニング量子化（PTQ）は、YOLOのような効率的な物体検出モデルをリソース制約のあるデバイスに展開する上で重要です。しかし、精度低下がノイズ、ぼかし、圧縮アーティファクトといった実世界の入力劣化に対するモデルの頑健性に与える影響は大きな懸念事項です。本論文では、YOLOモデル（ナノからエクストララージスケールまで）の複数の精度フォーマット（FP32、FP16（TensorRT）、Dynamic UINT8（ONNX）、Static INT8（TensorRT））における頑健性を評価する包括的な実証研究を提示します。また、Static INT8 PTQのための劣化対応キャリブレーション戦略を導入し、評価します。この戦略では、TensorRTのキャリブレーションプロセスにクリーンな画像と合成的に劣化させた画像の混合を曝露します。モデルは、COCOデータセットにおいて、7つの異なる劣化条件（様々な種類とレベルのノイズ、ぼかし、低コントラスト、JPEG圧縮を含む）および混合劣化シナリオでベンチマークされました。結果は、Static INT8 TensorRTエンジンがクリーンデータにおいて大幅な高速化（約1.5-3.3倍）と中程度の精度低下（約3-7% mAP50-95）を提供する一方で、提案された劣化対応キャリブレーションは、ほとんどのモデルと劣化において標準的なクライアントデータキャリブレーションよりも一貫した広範な頑健性の向上をもたらさなかったことを示しています。特定のノイズ条件下での大規模モデルにおいて注目すべき例外が観察され、モデルの容量がこのキャリブレーションアプローチの有効性に影響を与える可能性が示唆されました。これらの発見は、PTQの頑健性を向上させることの難しさを強調し、制御されていない環境での量子化検出器の展開に関する洞察を提供します。すべてのコードと評価表はhttps://github.com/AllanK24/QRIDで利用可能です。

English

Post-training quantization (PTQ) is crucial for deploying efficient object detection models, like YOLO, on resource-constrained devices. However, the impact of reduced precision on model robustness to real-world input degradations such as noise, blur, and compression artifacts is a significant concern. This paper presents a comprehensive empirical study evaluating the robustness of YOLO models (nano to extra-large scales) across multiple precision formats: FP32, FP16 (TensorRT), Dynamic UINT8 (ONNX), and Static INT8 (TensorRT). We introduce and evaluate a degradation-aware calibration strategy for Static INT8 PTQ, where the TensorRT calibration process is exposed to a mix of clean and synthetically degraded images. Models were benchmarked on the COCO dataset under seven distinct degradation conditions (including various types and levels of noise, blur, low contrast, and JPEG compression) and a mixed-degradation scenario. Results indicate that while Static INT8 TensorRT engines offer substantial speedups (~1.5-3.3x) with a moderate accuracy drop (~3-7% mAP50-95) on clean data, the proposed degradation-aware calibration did not yield consistent, broad improvements in robustness over standard clean-data calibration across most models and degradations. A notable exception was observed for larger model scales under specific noise conditions, suggesting model capacity may influence the efficacy of this calibration approach. These findings highlight the challenges in enhancing PTQ robustness and provide insights for deploying quantized detectors in uncontrolled environments. All code and evaluation tables are available at https://github.com/AllanK24/QRID.

入力劣化に対する量子化のロバスト性：物体検出のケーススタディ

Quantization Robustness to Input Degradations for Object Detection

要旨

Support