Revision as of 19:43, 10 October 2024 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Adaptive Tag: Visual edit ← Previous edit		Revision as of 02:20, 14 October 2024 edit undo Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Local response normalization: global Tag: Visual edit Next edit →
Line 171: There are some activation normalization techniques that are only used for CNNs. === ~~Local response~~Response normalization === '''Local response normalization'''<ref>{{Cite journal \|last1=Krizhevsky \|first1=Alex \|last2=Sutskever \|first2=Ilya \|last3=Hinton \|first3=Geoffrey E \|date=2012 \|title=ImageNet Classification with Deep Convolutional Neural Networks \|url=https://papers.nips.cc/paper_files/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html \|journal=Advances in Neural Information Processing Systems \|publisher=Curran Associates, Inc. \|volume=25}}</ref> was used in [[AlexNet]]. It was applied in a convolutional layer, just after a nonlinear activation function. It was defined by<math display="block">b_{x, y}^i=\frac{a_{x, y}^i}{\left(k+\alpha \sum_{j=\max (0, i-n / 2)}^{\min (N-1, i+n / 2)}\left(a_{x, y}^j\right)^2\right)^\beta}</math>where <math>a_{x,y}^i</math> is the activation of the neuron at ___location <math>(x,y)</math> and channel <math>i</math>. In words, each pixel in a channel is suppressed by the activations of the same pixel in its adjacent channels. Line 181: Both kinds of local normalization were obsoleted by batch normalization, which is a more global form of normalization.<ref>{{Cite journal \|last1=Ortiz \|first1=Anthony \|last2=Robinson \|first2=Caleb \|last3=Morris \|first3=Dan \|last4=Fuentes \|first4=Olac \|last5=Kiekintveld \|first5=Christopher \|last6=Hassan \|first6=Md Mahmudulla \|last7=Jojic \|first7=Nebojsa \|date=2020 \|title=Local Context Normalization: Revisiting Local Normalization \|url=https://openaccess.thecvf.com/content_CVPR_2020/html/Ortiz_Local_Context_Normalization_Revisiting_Local_Normalization_CVPR_2020_paper.html \|pages=11276–11285\|arxiv=1912.05845 }}</ref> Response normalization reappeared in ConvNeXT-2 as '''global response normalization'''.<ref>{{Cite journal \|last=Woo \|first=Sanghyun \|last2=Debnath \|first2=Shoubhik \|last3=Hu \|first3=Ronghang \|last4=Chen \|first4=Xinlei \|last5=Liu \|first5=Zhuang \|last6=Kweon \|first6=In So \|last7=Xie \|first7=Saining \|date=2023 \|title=ConvNeXt V2: Co-Designing and Scaling ConvNets With Masked Autoencoders \|url=https://openaccess.thecvf.com/content/CVPR2023/html/Woo_ConvNeXt_V2_Co-Designing_and_Scaling_ConvNets_With_Masked_Autoencoders_CVPR_2023_paper.html \|language=en \|pages=16133–16142}}</ref> === Group normalization ===

Normalization (machine learning): Difference between revisions