[CV] U-Net

[출처]: U-Net: Convolutional Networks for Biomedical Image Segmentation

unet.pdf

1.57MB

U-Net: 생의학 이미지 분할을 위한 컨볼루션 네트워크

저자:

Olaf Ronneberger, Philipp Fischer, Thomas Brox
독일 프라이부르크 대학교

요약:

이 논문은 생의학 이미지 분할을 위한 컨볼루션 네트워크 구조와 훈련 전략을 소개합니다. 주요 특징은 다음과 같습니다:

주석이 달린 샘플을 효율적으로 사용하기 위한 데이터 증강.
맥락을 포착하는 수축 경로와 정밀한 위치 지정이 가능한 확장 경로를 가진 대칭 아키텍처.
몇 개의 이미지로부터 엔드 투 엔드로 훈련 가능, 이전 방법보다 뛰어난 성능.
빠른 분할 능력 (최근 GPU에서 512x512 이미지 분할이 1초 이내).
구현 및 훈련된 네트워크는 온라인에서 제공.

소개:

최근 딥 컨볼루션 네트워크의 성공은 대규모 훈련 세트와 깊은 아키텍처 덕분.
생의학 이미지 분할은 일반 이미지 분류 작업과 달리 픽셀 수준의 분류가 필요.
이전 방법인 슬라이딩 윈도우 컨볼루션 네트워크는 느리고 위치 정확도와 맥락 사용 간의 트레이드오프가 존재.

아키텍처:

U-Net은 소수의 훈련 이미지로 작동하며 정밀한 분할을 제공하도록 완전 컨볼루션 네트워크를 수정.
아키텍처는 다음을 포함:
- 반복적인 컨볼루션과 맥스 풀링을 통한 맥락 캡처를 위한 수축 경로.
- 업샘플링과 연결을 통해 정밀한 위치 지정을 위한 확장 경로.
- 대칭 경로가 U자 모양을 형성.
- 완전히 연결된 레이어가 없어 오버랩 타일 전략을 통한 큰 이미지의 원활한 분할 가능.
- 탄성 변형을 포함한 광범위한 데이터 증강을 통해 불변성을 학습.

네트워크 훈련:

픽셀 단위의 소프트맥스와 교차 엔트로피 손실을 최적화하기 위해 높은 모멘텀으로 확률적 경사 하강법 사용.
손실 함수의 가중치를 통해 접촉하는 세포 사이의 분리를 강조.
가중치 초기화는 균형 잡힌 특징 맵 활성화를 보장.

실험:

EM 세분화 챌린지 (ISBI 2012): U-Net은 신경 구조 분할에서 최첨단 결과를 달성, 더 낮은 왜곡 및 Rand 오류.
ISBI 세포 추적 챌린지 2015:
- "PhC-U373" 데이터셋: 92% IOU 달성, 두 번째로 좋은 알고리즘보다 훨씬 우수.
- "DIC-HeLa" 데이터셋: 77.5% IOU 달성, 두 번째로 좋은 알고리즘보다 훨씬 우수.

결론:

U-Net은 다양한 생의학 분할 작업에서 매우 우수한 성능.
최소한의 주석이 달린 이미지로 충분히 훈련 가능, 합리적인 훈련 시간.
구현 및 훈련된 모델은 공개되어 있어 다른 작업에 쉽게 적용 가능.

감사의 말:

독일 연방 및 주 정부의 Excellence Initiative 및 BMBF의 지원을 받음.

Summary of U-Net: Convolutional Networks for Biomedical Image Segmentation

Authors:

Olaf Ronneberger, Philipp Fischer, and Thomas Brox
University of Freiburg, Germany

Abstract:

The paper introduces a convolutional network architecture and training strategy for biomedical image segmentation. Key features include:

Data augmentation to efficiently use annotated samples.
A symmetric architecture with a contracting path for context and an expanding path for precise localization.
The ability to train end-to-end from few images, outperforming previous methods.
Fast segmentation capabilities (less than a second for a 512x512 image on a recent GPU).
Implementation and trained networks available online.

Introduction:

Recent success in deep convolutional networks is due to large training sets and deep architectures.
Biomedical image segmentation requires pixel-level classification, unlike typical image classification tasks.
Prior methods like sliding-window convolutional networks are slow and have localization-context trade-offs.

Architecture:

U-Net modifies the fully convolutional network to work with few training images, providing precise segmentations.
The architecture includes:
- A contracting path with repeated convolutions and max pooling for context capture.
- An expanding path with upsampling and concatenation for precise localization.
- Symmetric paths forming a U-shape.
- No fully connected layers, allowing seamless segmentation of large images via overlap-tile strategy.
- Extensive data augmentation, particularly elastic deformations, to learn invariances.

Network Training:

Uses stochastic gradient descent with high momentum to optimize the pixel-wise soft-max and cross-entropy loss.
Weighted loss function emphasizes separation between touching cells.
Initialization of weights ensures balanced feature map activations.

Experiments:

EM Segmentation Challenge (ISBI 2012): U-Net achieved state-of-the-art results on neuronal structures segmentation with lower warping and Rand errors.
ISBI Cell Tracking Challenge 2015:
- "PhC-U373" dataset: Achieved 92% IOU, significantly better than the second-best algorithm.
- "DIC-HeLa" dataset: Achieved 77.5% IOU, also significantly better than the second-best algorithm.

Conclusion:

U-Net performs exceptionally well across various biomedical segmentation tasks.
Requires minimal annotated images and has a reasonable training time.
Implementation and trained models are publicly available, allowing easy application to other tasks.

Acknowledgements:

Supported by the Excellence Initiative of the German Federal and State governments and the BMBF.

저작자표시 비영리 변경금지 (새창열림)

'AI 논문 > Computer Vision' 카테고리의 다른 글

[CV] CvT (1)	2024.06.07
[CV] Deformable Convolutional Networks (0)	2024.06.07
[CV] YOLO v2 (0)	2024.06.07
[CV] YOLO (1)	2024.06.07
[CV] Cascade R-CNN (0)	2024.06.07

cogito30's AI Develope Blog

[CV] U-Net

U-Net: 생의학 이미지 분할을 위한 컨볼루션 네트워크

저자:

요약:

소개:

아키텍처:

네트워크 훈련:

실험:

결론:

감사의 말:

Summary of U-Net: Convolutional Networks for Biomedical Image Segmentation

Authors:

Abstract:

Introduction:

Architecture:

Network Training:

Experiments:

Conclusion:

Acknowledgements:

'AI 논문 > Computer Vision' 카테고리의 다른 글

티스토리툴바

[CV] U-Net

U-Net: 생의학 이미지 분할을 위한 컨볼루션 네트워크

저자:

요약:

소개:

아키텍처:

네트워크 훈련:

실험:

결론:

감사의 말:

Summary of U-Net: Convolutional Networks for Biomedical Image Segmentation

Authors:

Abstract:

Introduction:

Architecture:

Network Training:

Experiments:

Conclusion:

Acknowledgements:

'AI 논문 > Computer Vision' 카테고리의 다른 글

관련글

티스토리툴바