Which activation is most used to mitigate vanishing gradients?
Answer options
A
Sigmoid
B
Tanh
C
ReLU
D
Linear
Correct answer: ReLU
Explanation
Quick AnswerThe correct answer is ReLU because it directly addresses the core logic of Generative AI.
The correct answer is: ReLU.