CMU 11-785 L10 CNN architecture

最新推荐文章于 2026-06-17 17:11:48 发布

原创最新推荐文章于 2026-06-17 17:11:48 发布 · 294 阅读

·

0

·

本内容遵循CC 4.0 BY-SA版权协议

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

标签

#filter #深度学习 #数据挖掘 #机器学习 #神经网络

CMU 11-785 专栏收录该内容

22 篇文章

订阅专栏

本文深入探讨了卷积神经网络的架构，包括卷积层和下采样层的工作原理，解释了扫描图像的过滤器概念，以及如何通过调整步长和过滤器大小来改变输出尺寸。同时，文章还讨论了如何保持图像大小不变，以及如何通过增加过滤器数量来捕捉更复杂的模式。

Architecture

在这里插入图片描述

A convolutional neural network comprises “convolutional” and “downsampling ” layers
- Convolutional layers comprise neurons that scan their input for patterns
- Downsampling layers perform max operations on groups of outputs from the convolutional layers
  - Perform on individual map
  - For reduce the number of parameters
The two may occur in any sequence, but typically they alternate
Followed by an MLP with one or more layers

A convolutional layer

Each activation map has two components
- An affine map, obtained by convolution over maps in the previous layer
  - Each affine map has, associated with it, a learnable filter
- An activation that operates on the output of the convolution
What is a convolution
- Scanning an image with a “filter”
- Equivalent to scanning with an MLP
Weights
- size of the filter $×\times$ no. of maps in previous layer
Size
- Image size: $N×NN\times N$
- Filter: $M×MM\times M$
- Stride: $S$
- Output size = $⌊(N−M)/S⌋+1\lfloor(N-M) / S\rfloor+1$
Jargon
- Filters are often called “Kernels”
- The outputs of individual filters are called “channels”

Notion

Each convolution layer maintains the size of the image
- With appropriate zero padding
- If performed without zero padding it will decrease the size of the input
Each convolution layer may increase the number of maps from the previous layer
- Depends on the number of filters
Each pooling layer with hop $D$ decreases the size of the maps by a factor of $D$
Filters within a layer must all be the same size, but sizes may vary with layer
- Similarly for pooling, $D$ may vary with layer
In general the number of convolutional filters increases with layers
- Because the patterns gets more complex, hence larger combinations of patterns to capture
Training is as in the case of the regular MLP
- The only difference is in the structure of the network

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。