1. Jurafsky
D.
& Martin
J. H. (2023). Speech and Language Processing (3rd ed. draft). Chapter 1
"Introduction
" defines NLP as the field dedicated to computer processing of human language. Chapter 17
"Information Extraction
" details the specific task of extracting structured data from unstructured text. (Available via Stanford University course pages).
2. Manning
C. D.
& Schütze
H. (1999). Foundations of Statistical Natural Language Processing. MIT Press. Chapter 1
Section 1.1
describes NLP's goal as designing algorithms that allow computers to process human language
including tasks like information extraction from documents.
3. Appelbaum
D.
Kogan
A.
& Vasarhelyi
M. A. (2018). How artificial intelligence is changing the audit process. Journal of Emerging Technologies in Accounting
15(2)
1-18. Section "Textual Analysis
" discusses how NLP and text mining are used to analyze unstructured data sources like contracts and legal documents to identify risks and key terms. (https://doi.org/10.2308/jeta-52232)