Segmentation Thermal
Back to index Back to Segmentation
Reference | Sensors | Semantics | Sensing Modality Representations | Fusion Operation and Method | Fusion Level | Dataset(s) used |
---|---|---|---|---|---|---|
Valada et al., 2019 [pdf][ref] | Visual camera, depth camera, thermal camera | Multiple 2D objects | RGB image, thermal image, depth image. Each processed by FCN with ResNet backbone (Adapnet++ architecture) | Extension of Mixture of Experts | Middle | Six datasets, including Cityscape, Sun RGB-D, etc. |
Sun et al., 2019 [pdf][ref] | Visual camera, thermal camera | Multiple 2D objects in campus environments | RGB image, thermal image. Each processed by a base network built on ResNet | Element-wise summation in the encoder networks | Middle | Datasets published by [ref] |
Guan et al., 2018 [pdf][ref] | Vision camera, thermal camera | 2D Pedestrian | RGB image, thermal image. Each processed by a base network built on VGG16 | Feature concatenation, Mixture of Experts | Early, Middle, Late | KAIST Pedestrian Dataset |
Ha et al., 2017 [pdf][ref] | Vision camera, thermal camera | Multiple 2D objects in campus environments | RGB image, thermal image. Each processed by a FCN and mini-inception block | Feature concatenation, addition (``short-cut fusion'') | Middle | self-recorded data |
Valada et al., 2017 [pdf][ref] | Vision camera, thermal camera | Multiple 2D objects | RGB image, thermal image, depth image. Each processed by FCN with ResNet backbone | Mixture of Experts | Late | Cityscape, Freiburg Multispectral Dataset, Synthia |
Valada et al., 2016 [pdf][ref] | Vision camera, thermal camera | Multiple 2D objects in forested environments | RGB image, thermal image, depth image. Each processed by the UpNet (built on VGG16 and up-convolution) | Feature concatenation, addition | Early, Late | self-recorded data |