Neuron-level Interpretation of Deep NLP Models: A Survey
<p dir="ltr">The proliferation of Deep Neural Networks in various domains has seen an increased need for interpretability of these models. Preliminary work done along this line, and papers that surveyed such, are focused on high-level representation analysis. However, a recent branch...
محفوظ في:
| المؤلف الرئيسي: | |
|---|---|
| مؤلفون آخرون: | , |
| منشور في: |
2022
|
| الموضوعات: | |
| الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
| _version_ | 1864513518552743936 |
|---|---|
| author | Hassan Sajjad (5297441) |
| author2 | Nadir Durrani (5297438) Fahim Dalvi (18427905) |
| author2_role | author author |
| author_facet | Hassan Sajjad (5297441) Nadir Durrani (5297438) Fahim Dalvi (18427905) |
| author_role | author |
| dc.creator.none.fl_str_mv | Hassan Sajjad (5297441) Nadir Durrani (5297438) Fahim Dalvi (18427905) |
| dc.date.none.fl_str_mv | 2022-11-22T03:00:00Z |
| dc.identifier.none.fl_str_mv | 10.1162/tacl_a_00519 |
| dc.relation.none.fl_str_mv | https://figshare.com/articles/journal_contribution/Neuron-level_Interpretation_of_Deep_NLP_Models_A_Survey/25672512 |
| dc.rights.none.fl_str_mv | CC BY 4.0 info:eu-repo/semantics/openAccess |
| dc.subject.none.fl_str_mv | Information and computing sciences Artificial intelligence Machine learning Language, communication and culture Linguistics Deep Neural Networks Interpretability High-level representation analysis Neuron analysis Evaluation methods |
| dc.title.none.fl_str_mv | Neuron-level Interpretation of Deep NLP Models: A Survey |
| dc.type.none.fl_str_mv | Text Journal contribution info:eu-repo/semantics/publishedVersion text contribution to journal |
| description | <p dir="ltr">The proliferation of Deep Neural Networks in various domains has seen an increased need for interpretability of these models. Preliminary work done along this line, and papers that surveyed such, are focused on high-level representation analysis. However, a recent branch of work has concentrated on interpretability at a more granular level of analyzing neurons within these models. In this paper, we survey the work done on neuron analysis including: i) methods to discover and understand neurons in a network; ii) evaluation methods; iii) major findings including cross architectural comparisons that neuron analysis has unraveled; iv) applications of neuron probing such as: controlling the model, domain adaptation, and so forth; and v) a discussion on open issues and future research directions.</p><h2>Other Information</h2><p dir="ltr">Published in: Transactions of the Association for Computational Linguistics<br>License: <a href="https://creativecommons.org/licenses/by/4.0/" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1162/tacl_a_00519" target="_blank">https://dx.doi.org/10.1162/tacl_a_00519</a></p> |
| eu_rights_str_mv | openAccess |
| id | Manara2_7146852c285ece4a75f712425af72200 |
| identifier_str_mv | 10.1162/tacl_a_00519 |
| network_acronym_str | Manara2 |
| network_name_str | Manara2 |
| oai_identifier_str | oai:figshare.com:article/25672512 |
| publishDate | 2022 |
| repository.mail.fl_str_mv | |
| repository.name.fl_str_mv | |
| repository_id_str | |
| rights_invalid_str_mv | CC BY 4.0 |
| spelling | Neuron-level Interpretation of Deep NLP Models: A SurveyHassan Sajjad (5297441)Nadir Durrani (5297438)Fahim Dalvi (18427905)Information and computing sciencesArtificial intelligenceMachine learningLanguage, communication and cultureLinguisticsDeep Neural NetworksInterpretabilityHigh-level representation analysisNeuron analysisEvaluation methods<p dir="ltr">The proliferation of Deep Neural Networks in various domains has seen an increased need for interpretability of these models. Preliminary work done along this line, and papers that surveyed such, are focused on high-level representation analysis. However, a recent branch of work has concentrated on interpretability at a more granular level of analyzing neurons within these models. In this paper, we survey the work done on neuron analysis including: i) methods to discover and understand neurons in a network; ii) evaluation methods; iii) major findings including cross architectural comparisons that neuron analysis has unraveled; iv) applications of neuron probing such as: controlling the model, domain adaptation, and so forth; and v) a discussion on open issues and future research directions.</p><h2>Other Information</h2><p dir="ltr">Published in: Transactions of the Association for Computational Linguistics<br>License: <a href="https://creativecommons.org/licenses/by/4.0/" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1162/tacl_a_00519" target="_blank">https://dx.doi.org/10.1162/tacl_a_00519</a></p>2022-11-22T03:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1162/tacl_a_00519https://figshare.com/articles/journal_contribution/Neuron-level_Interpretation_of_Deep_NLP_Models_A_Survey/25672512CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/256725122022-11-22T03:00:00Z |
| spellingShingle | Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad (5297441) Information and computing sciences Artificial intelligence Machine learning Language, communication and culture Linguistics Deep Neural Networks Interpretability High-level representation analysis Neuron analysis Evaluation methods |
| status_str | publishedVersion |
| title | Neuron-level Interpretation of Deep NLP Models: A Survey |
| title_full | Neuron-level Interpretation of Deep NLP Models: A Survey |
| title_fullStr | Neuron-level Interpretation of Deep NLP Models: A Survey |
| title_full_unstemmed | Neuron-level Interpretation of Deep NLP Models: A Survey |
| title_short | Neuron-level Interpretation of Deep NLP Models: A Survey |
| title_sort | Neuron-level Interpretation of Deep NLP Models: A Survey |
| topic | Information and computing sciences Artificial intelligence Machine learning Language, communication and culture Linguistics Deep Neural Networks Interpretability High-level representation analysis Neuron analysis Evaluation methods |