Image Restoration and Enhancement for Visual Recognition
Fall 2016 - Present
Description
To build a visual recognition system for any application these days, one's first inclination is to turn to the most recent machine learning breakthrough from the area of deep learning, which no doubt has been enabled by access to millions of training images from the Internet. But there are many circumstances where such an approach cannot be used as an off-the-shelf component to assemble the system we desire, because even the largest training dataset does not take into account all of the artifacts that can be experienced in the environment. As computer vision pushes further into real-world applications, what should a software system that can interpret images from sensors placed in any unrestrictedsetting actually look like?
First, it must incorporate a set of algorithms, drawn from the areas of computational photography and machine learning, into a processing pipeline that corrects and subsequently classifies images across time and space. Image restoration and enhancement algorithms that remove corruptions like blur, noise, and mis-focus, or manipulate images to gain resolution, change perspective, and compensate for lens distortion are now commonplace in photo editing tools. Such operations are necessary to improve the quality of raw images that are otherwise unacceptable for recognition purposes. But they must be compatible with the recognition process itself, and not adversely affect feature extraction or classification.
Remarkably, little thought has been given to image restoration and enhancement algorithms for visual recognition - the goal of computational photography thus far has simply been to make images look appealing after correction. It remains unknown what impact many transformations have on visual recognition algorithms. To begin to answer that question, exploratory work is needed to find out which image pre-processing algorithms, in combination with the strongest features and supervised machine learning approaches, are promising candidates for different problem domains.
This work was supported by IARPA contract #2016-16070500002 and the NVIDIA Corporation
Publications
- "Report on UG^2+ Challenge Track 1: Assessing Algorithms to Improve Video, , , ,
Object Detection and Classification from Unconstrained Mobility Platforms,"Computer Vision and Image Understanding (CVIU),December 2021.[pdf][bibtex]@article{banerjee2021report,
title={Report on UG^2+ Challenge Track 1: Assessing Algorithms to Improve
Video Object Detection and Classification from unconstrained mobility platforms},
author={Banerjee, Sreya and
VidalMata, Rosaura G and
Wang, Zhangyang and
Scheirer, Walter J},
journal={Computer Vision and Image Understanding},
volume={213},
pages={103297},
year={2021},
publisher={Elsevier}
}
- "Bridging the Gap Between Computational Photography and Visual, , , ,
Recognition,"
, , , , ,
, , , , , ,
, , , , ,
, , , ,IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI),December 2021.[pdf][bibtex]@article{vidalmata2021bridging,
title = {Bridging the Gap Between Computational Photography and Visual
Recognition},
author = {Rosaura G. VidalMata and
Sreya Banerjee and
Brandon RichardWebster and
Michael Albright and
Pedro Davalos and
Scott McCloskey and
Ben Miller and
Asong Tambo and
Sushobhan Ghosh and
Sudarshan Nagesh and
Ye Yuan and
Yueyu Hu and
Junru Wu and
Wenhan Yang and
Xiaoshuai Zhang and
Jiaying Liu and
Zhangyang Wang and
Hwann-Tzong Chen and
Tzu-Wei Huang and
Wen-Chi Chin and
Yi-Chun Li and
Mahmoud Lababidi and
Charles Otto and
Walter J. Scheirer},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI)},
volume = {43},
number = {12},
month = {December},
year = {2021}
}
- "Advancing Image Understanding in Poor Visibility Environments: A Collective, , , , , ,
Benchmark Study,"
,IEEE Transactions on Image Processing,March 2020.[pdf][bibtex]@article{yang2020advancing,
title={Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study},
author={Yang, Wenhan and
Yuan, Ye and
Ren, Wenqi and
Liu, Jiaying and
Scheirer, Walter and
Wang, Zhangyang and
others},
journal={IEEE Transactions on Image Processing},
volume={29},
pages={5737--5752},
year={2020},
publisher={IEEE}
}
- "UG^2: a Video Benchmark for Assessing the Impact of Image Restoration, , , , ,
and Enhancement on Automatic Visual Recognition,"Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV),March 2018.[pdf] [dataset][bibtex]@InProceedings{vidal2018ug,
title={UG$^{2}$: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition},
author={Vidal, Rosaura G and Banerjee, Sreya and Grm, Klemen and Struc, Vitomir and Scheirer, Walter J},
booktitle={IEEE Winter Conference on Applications of Computer Vision (WACV)},
year={2018}
}