Segmentation Thermal

Back to index Back to Segmentation

Reference	Sensors	Semantics	Sensing Modality Representations	Fusion Operation and Method	Fusion Level	Dataset(s) used
Valada et al., 2019 [pdf][ref]	Visual camera, depth camera, thermal camera	Multiple 2D objects	RGB image, thermal image, depth image. Each processed by FCN with ResNet backbone (Adapnet++ architecture)	Extension of Mixture of Experts	Middle	Six datasets, including Cityscape, Sun RGB-D, etc.
Sun et al., 2019 [pdf][ref]	Visual camera, thermal camera	Multiple 2D objects in campus environments	RGB image, thermal image. Each processed by a base network built on ResNet	Element-wise summation in the encoder networks	Middle	Datasets published by [ref]
Guan et al., 2018 [pdf][ref]	Vision camera, thermal camera	2D Pedestrian	RGB image, thermal image. Each processed by a base network built on VGG16	Feature concatenation, Mixture of Experts	Early, Middle, Late	KAIST Pedestrian Dataset
Ha et al., 2017 [pdf][ref]	Vision camera, thermal camera	Multiple 2D objects in campus environments	RGB image, thermal image. Each processed by a FCN and mini-inception block	Feature concatenation, addition (``short-cut fusion'')	Middle	self-recorded data
Valada et al., 2017 [pdf][ref]	Vision camera, thermal camera	Multiple 2D objects	RGB image, thermal image, depth image. Each processed by FCN with ResNet backbone	Mixture of Experts	Late	Cityscape, Freiburg Multispectral Dataset, Synthia
Valada et al., 2016 [pdf][ref]	Vision camera, thermal camera	Multiple 2D objects in forested environments	RGB image, thermal image, depth image. Each processed by the UpNet (built on VGG16 and up-convolution)	Feature concatenation, addition	Early, Late	self-recorded data