- An LSTM has 4 gates: feature, input, output, forget. We might ask: are all 4 necessary? What if I remove one? Indeed, lots of experimentation has gone into LSTM variants, the GRU being a notable example (which is simpler).
- If certain tricks are used to get an algorithm to work, it’s useful to know whether the algorithm is robust to removing these tricks. For example, DeepMind’s original DQN paper reports using (1) only periodically updating the reference network and (2) using a replay buffer rather than updating online. It’s very useful for the research community to know that both these tricks are necessary, in order to build on top of these results.
- If an algorithm is a modification of a previous work, and has multiple differences, researchers want to know what the key difference is.
- Simpler is better (inductive prior towards simpler model classes). If you can get the same performance with two models, prefer the simpler one.
An ablation study typically refers to removing some “feature” of the model or algorithm, and seeing how that affects performance.
모델이나 알고리즘의 특징들을 제거하면서 그게 퍼포먼스에 어떤 영향을 줄지 연구하는 거
https://www.quora.com/In-the-context-of-deep-learning-what-is-an-ablation-study
For instance,
Ablation studies by digitally removing whole fish or parts of the fish from the images revealed that the classifier learned discriminative features from the image foreground, and we observed that the deformations of the head region, rather than the visually apparent bent tail, were more important for good classification performance
or
we ablated (i.e. removed) some of the repeated elements to see if the DNN confidence score for that image drops. Psychologists use the same ablation technique to learn which image features humans use to recognize objects
'Data-science > 논문 읽기' 카테고리의 다른 글
gan 논문에 등장하는 artifact 뜻? (0) | 2020.10.04 |
---|---|
논문에 등장하는 revisit 뜻? (0) | 2020.10.04 |
[Deep learning 논문 읽기] StyleGAN loss 이해, 얕게 읽는 WGAN, WGAN-GP (0) | 2020.09.01 |
[Deep learning 논문 읽기] style-gan 1 (0) | 2020.08.26 |
[딥러닝 논문 리뷰] A Spatio-Temporal Spot-Forecasting Framework for Urban Traffic Prediction - 1 (0) | 2020.06.22 |