基于深度学习的无监督领域自适应语义分割算法综述-AET-电子技术应用

基于深度学习的无监督领域自适应语义分割算法综述

电子技术应用

应俊杰1，2，楼陆飞1，2，辛宇1，2

1.宁波大学信息科学与工程学院，浙江宁波315211；2.浙江省移动网应用技术重点实验室，浙江宁波315211

摘要： 随着现代生活逐步智能化，越来越多的应用需要从图像中推断相应的语义信息再进行后续的处理，如虚拟现实、自动驾驶和视频监控等应用。目前的语义分割模型利用大量标注数据进行有监督训练能达到理想的性能，但模型对与训练数据不同分布的数据进行推理时，其性能严重下降。这意味着一旦应用场景发生变化，就需对新场景的数据进行标注。模型重新利用新数据进行训练，才能达到正常的性能。这无疑是耗时的、代价昂贵的。为此，领域自适应语义分割算法提供了解决模型在分布不一致数据上语义分割性能下降问题的思路。总结了领域自适应语义分割算法的前沿进展，并对未来研究方向进行展望。

关键词： 领域自适应语义分割深度学习

中图分类号：TP391.4 文献标志码：A DOI: 10.16157/j.issn.0258-7998.234261
中文引用格式： 应俊杰，楼陆飞，辛宇. 基于深度学习的无监督领域自适应语义分割算法综述[J]. 电子技术应用，2024，50(1)：1-9.
英文引用格式： Ying Junjie，Lou Lufei，Xin Yu. A survey of unsupervised domain adaptive semantic segmentation algorithms based on deep learning[J]. Application of Electronic Technique，2024，50(1)：1-9.

A survey of unsupervised domain adaptive semantic segmentation algorithms based on deep learning

Ying Junjie1，2，Lou Lufei1，2，Xin Yu1，2

1.College of Information Science and Engineering， Ningbo University， Ningbo 315211， China； 2.Key Laboratory of Mobile Network Application Technology of Zhejiang Province， Ningbo 315211， China

Abstract： As modern life becomes increasingly intelligent, more and more applications require inferring semantic information from images before proceeding with further processing, such as virtual reality, autonomous driving, and video surveillance. Current semantic segmentation models achieve ideal performance through supervised training with a large amount of annotated data, but their performance severely deteriorates when inferring on data with a distribution different from the training data. This means that once the application scenario changes, new data needs to be annotated and the model needs to be retrained with the new data in order to achieve normal performance. This is undoubtedly time-consuming and expensive. Therefore, domain adaptive semantic segmentation algorithms provide a solution to the problem of the model's performance degradation on data with different distributions. This article summarizes the cutting-edge progress of domain adaptive semantic segmentation algorithms and looks forward to future research directions.

Key words : domain adaptive；semantic segmentation；deep learning

引言

语义分割是计算机视觉的基础任务之一，它为图像的每个像素进行类别预测，目的是将图像分割成若干个带有语义的感兴趣区域，以便后续的图像理解和分析工作，推动了自动驾驶、虚拟现实、医学影像分析和卫星成像等领域的发展。近几年来，语义分割模型的性能有着巨大的提升。然而，模型的性能依赖于大量人工标注的训练数据，这些数据的标注是十分耗时且代价昂贵的，纯人工标注一张图的时间甚至可能超过一个小时。即使现在使用半自动化标注工具自动生成一部分标注，可以减少标注的时间，但仍然需要人工去调整和检查自动生成的标注。语义分割模型需要在与训练数据分布一致的数据上才能获得优异的性能，而为另一不同分布的数据进行语义标注的代价很大。

本文详细内容请下载：

https://www.chinaaet.com/resource/share/2000005825

作者信息：

应俊杰1，2，楼陆飞1，2，辛宇1，2

（1.宁波大学信息科学与工程学院，浙江宁波315211；2.浙江省移动网应用技术重点实验室，浙江宁波315211）

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容