0% found this document useful (0 votes)

152 views3 pages

Natural Language Processing

Natural language processing (NLP) is a field that develops techniques to enable computers to understand and generate human languages. The history of NLP began in the 1950s with early attempts at machine translation, but progress was slow. In the 1960s and 1970s, researchers developed systems that could understand restricted domains and simulate conversations. A revolution occurred in the late 1980s with the introduction of machine learning algorithms, allowing systems to learn from large corpora of text data. Modern NLP relies heavily on machine learning techniques like neural networks to perform tasks such as part-of-speech tagging, parsing, and machine translation.

Uploaded by

A K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

152 views3 pages

Natural Language Processing

Uploaded by

A K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Natural language Processing

From Wikipedia, the free encyclopedia

This article is about language processing by computers. For the processing of language by the
human brain, see Language processing. For the non-science communication approach, see
Neuro-linguistic programming.
Natural language processing (NLP) is a field of computer science, artificial intelligence, and
linguistics concerned with the interactions between computers and human (natural) languages.
As such, NLP is related to the area of humancomputer interaction. Many challenges in NLP
involve natural language understanding, that is, enabling computers to derive meaning from
human or natural language input, and others involve natural language generation

History
Main article: History of natural language processing

The history of NLP generally starts in the 1950s, although work can be found from earlier
periods. In 1950, Alan Turing published an article titled "Computing Machinery and Intelligence"
which proposed what is now called the Turing test as a criterion of intelligence.
The Georgetown experiment in 1954 involved fully automatic translation of more than sixty
Russian sentences into English. The authors claimed that within three or five years, machine
translation would be a solved problem.[2] However, real progress was much slower, and after the
ALPAC report in 1966, which found that ten year long research had failed to fulfill the
expectations, funding for machine translation was dramatically reduced. Little further research in
machine translation was conducted until the late 1980s, when the first statistical machine
translation systems were developed.
Some notably successful NLP systems developed in the 1960s were SHRDLU, a natural
language system working in restricted "blocks worlds" with restricted vocabularies, and ELIZA,
a simulation of a Rogerian psychotherapist, written by Joseph Weizenbaum between 1964 to
1966. Using almost no information about human thought or emotion, ELIZA sometimes
provided a startlingly human-like interaction. When the "patient" exceeded the very small
knowledge base, ELIZA might provide a generic response, for example, responding to "My head
hurts" with "Why do you say your head hurts?".
During the 1970s many programmers began to write 'conceptual ontologies', which structured
real-world information into computer-understandable data. Examples are MARGIE (Schank,
1975), SAM (Cullingford, 1978), PAM (Wilensky, 1978), TaleSpin (Meehan, 1976), QUALM
(Lehnert, 1977), Politics (Carbonell, 1979), and Plot Units (Lehnert 1981). During this time,
many chatterbots were written including PARRY, Racter, and Jabberwacky.
Up to the 1980s, most NLP systems were based on complex sets of hand-written rules. Starting
in the late 1980s, however, there was a revolution in NLP with the introduction of machine
learning algorithms for language processing. This was due to both the steady increase in
computational power resulting from Moore's Law and the gradual lessening of the dominance of
Chomskyan theories of linguistics (e.g. transformational grammar), whose theoretical
underpinnings discouraged the sort of corpus linguistics that underlies the machine-learning
approach to language processing.[3] Some of the earliest-used machine learning algorithms, such
as decision trees, produced systems of hard if-then rules similar to existing hand-written rules.

However, Part of speech tagging introduced the use of Hidden Markov Models to NLP, and
increasingly, research has focused on statistical models, which make soft, probabilistic decisions
based on attaching real-valued weights to the features making up the input data. The cache
language models upon which many speech recognition systems now rely are examples of such
statistical models. Such models are generally more robust when given unfamiliar input,
especially input that contains errors (as is very common for real-world data), and produce more
reliable results when integrated into a larger system comprising multiple subtasks.
Many of the notable early successes occurred in the field of machine translation, due especially to
work at IBM Research, where successively more complicated statistical models were developed.
These systems were able to take advantage of existing multilingual textual corpora that had been
produced by the Parliament of Canada and the European Union as a result of laws calling for the
translation of all governmental proceedings into all official languages of the corresponding
systems of government. However, most other systems depended on corpora specifically
developed for the tasks implemented by these systems, which was (and often continues to be) a
major limitation in the success of these systems. As a result, a great deal of research has gone
into methods of more effectively learning from limited amounts of data.
Recent research has increasingly focused on unsupervised and semi-supervised learning
algorithms. Such algorithms are able to learn from data that has not been hand-annotated with the
desired answers, or using a combination of annotated and non-annotated data. Generally, this task
is much more difficult than supervised learning, and typically produces less accurate results for a
given amount of input data. However, there is an enormous amount of non-annotated data
available (including, among other things, the entire content of the World Wide Web), which can
often make up for the inferior results.

NLP using machine learning

This section does not cite any references or sources. Please help improve this section by
adding citations to reliable sources. Unsourced material may be challenged and removed.
(February 2013)

Modern NLP algorithms are based on machine learning, especially statistical machine learning.
The paradigm of machine learning is different from that of most prior attempts at language
processing. Prior implementations of language-processing tasks typically involved the direct
hand coding of large sets of rules. The machine-learning paradigm calls instead for using general
learning algorithms often, although not always, grounded in statistical inference to
automatically learn such rules through the analysis of large corpora of typical real-world
examples. A corpus (plural, "corpora") is a set of documents (or sometimes, individual
sentences) that have been hand-annotated with the correct values to be learned.
Many different classes of machine learning algorithms have been applied to NLP tasks. These
algorithms take as input a large set of "features" that are generated from the input data. Some of
the earliest-used algorithms, such as decision trees, produced systems of hard if-then rules similar
to the systems of hand-written rules that were then common. Increasingly, however, research has
focused on statistical models, which make soft, probabilistic decisions based on attaching
real-valued weights to each input feature. Such models have the advantage that they can express
the relative certainty of many different possible answers rather than only one, producing more
reliable results when such a model is included as a component of a larger system.

Systems based on machine-learning algorithms have many advantages over hand-produced rules:

The learning procedures used during machine learning automatically focus on the most
common cases, whereas when writing rules by hand it is often not obvious at all where the
effort should be directed.
Automatic learning procedures can make use of statistical inference algorithms to produce
models that are robust to unfamiliar input (e.g. containing words or structures that have not
been seen before) and to erroneous input (e.g. with misspelled words or words accidentally
omitted). Generally, handling such input gracefully with hand-written rules or more
generally, creating systems of hand-written rules that make soft decisions is extremely
difficult, error-prone and time-consuming.
Systems based on automatically learning the rules can be made more accurate simply by
supplying more input data. However, systems based on hand-written rules can only be made
more accurate by increasing the complexity of the rules, which is a much more difficult task. In
particular, there is a limit to the complexity of systems based on hand-crafted rules, beyond
which the systems become more and more unmanageable. However, creating more data to
input to machine-learning systems simply requires a corresponding increase in the number of
man-hours worked, generally without significant increases in the complexity of the annotation
process.

The subfield of NLP devoted to learning approaches is known as Natural Language Learning
(NLL) and its conference CoNLL and peak body SIGNLL are sponsored by ACL, recognizing
also their links with Computational Linguistics and Language Acquisition. When the aims of
computational language learning research is to understand more about human language
acquisition, or psycholinguistics, NLL overlaps into the related field of Computational
Psycholinguistics.

NLP StudyMaterial
No ratings yet
NLP StudyMaterial
540 pages
Research Paper (NLP)
No ratings yet
Research Paper (NLP)
14 pages
English Sindhi Grammar
67% (3)
English Sindhi Grammar
3 pages
Intro NLP
No ratings yet
Intro NLP
47 pages
Intro. To NLP
No ratings yet
Intro. To NLP
18 pages
Module 1 Lecture 1
No ratings yet
Module 1 Lecture 1
29 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
Natural Language Processing (NLP) Research at Boston University
No ratings yet
Natural Language Processing (NLP) Research at Boston University
68 pages
Biz Speak 2 PDF
50% (2)
Biz Speak 2 PDF
240 pages
256_FA2024_INTRO-1
No ratings yet
256_FA2024_INTRO-1
66 pages
How The Statistical Revolution Changes (Computational) Linguistics
No ratings yet
How The Statistical Revolution Changes (Computational) Linguistics
14 pages
Unit 1
No ratings yet
Unit 1
18 pages
Natural Language Processing-Wiki
No ratings yet
Natural Language Processing-Wiki
237 pages
Why I Love Australia S1
100% (2)
Why I Love Australia S1
24 pages
NLP Unit 1 and 2
No ratings yet
NLP Unit 1 and 2
106 pages
Natural Language Processing - Wikipedia
No ratings yet
Natural Language Processing - Wikipedia
84 pages
Wolfram, Robert Johnson-Phonological Analysis - Focus On American English-Centre For Applied Linguistics (1982)
100% (2)
Wolfram, Robert Johnson-Phonological Analysis - Focus On American English-Centre For Applied Linguistics (1982)
228 pages
Text Analysis Based On Natural Language Processing NLP
No ratings yet
Text Analysis Based On Natural Language Processing NLP
5 pages
Jones 1994
No ratings yet
Jones 1994
14 pages
Natural Language Processing - Wikipedia
No ratings yet
Natural Language Processing - Wikipedia
20 pages
2
No ratings yet
2
62 pages
First Steps in Theoretical and Applied Linguistics
100% (1)
First Steps in Theoretical and Applied Linguistics
212 pages
NLP chap1
No ratings yet
NLP chap1
50 pages
AIYA Session 3 Presentation (1)
No ratings yet
AIYA Session 3 Presentation (1)
40 pages
Cornerstone 6 Modals
100% (1)
Cornerstone 6 Modals
6 pages
NLP notes2
No ratings yet
NLP notes2
27 pages
38. Natural Language Processing (1) Copy
No ratings yet
38. Natural Language Processing (1) Copy
30 pages
Hadi Pres, 21-12-24-1
No ratings yet
Hadi Pres, 21-12-24-1
16 pages
A998 PDF
No ratings yet
A998 PDF
16 pages
Lenguaje Natural
No ratings yet
Lenguaje Natural
6 pages
Bhabha Stereotype
No ratings yet
Bhabha Stereotype
6 pages
CC S 339 NLP Basics &TSA
No ratings yet
CC S 339 NLP Basics &TSA
68 pages
Nlp Notes Unit 1
No ratings yet
Nlp Notes Unit 1
42 pages
PresentationDayone-Introduction of NLP
No ratings yet
PresentationDayone-Introduction of NLP
17 pages
Nlp Notes Unit 1
No ratings yet
Nlp Notes Unit 1
42 pages
Natural Language Processing With Python A Comprehensive Guide To NLP in The Age of AI For 2024 (Hayden Van Der Post) (Z-Library)
No ratings yet
Natural Language Processing With Python A Comprehensive Guide To NLP in The Age of AI For 2024 (Hayden Van Der Post) (Z-Library)
315 pages
Topic 2: Introduction To Natural Language Processing (NLP)
No ratings yet
Topic 2: Introduction To Natural Language Processing (NLP)
16 pages
Natural_Language_Processing (NLP)
No ratings yet
Natural_Language_Processing (NLP)
32 pages
Introduction NLP
No ratings yet
Introduction NLP
32 pages
Tugas Kelompok Ke-3 Minggu 8/sesi 12
No ratings yet
Tugas Kelompok Ke-3 Minggu 8/sesi 12
5 pages
nlp-1
No ratings yet
nlp-1
37 pages
Paper2685 (1) (1)
No ratings yet
Paper2685 (1) (1)
7 pages
Tuggy 1998 Nahuatl Causative Applicative
No ratings yet
Tuggy 1998 Nahuatl Causative Applicative
26 pages
PDF142
No ratings yet
PDF142
6 pages
Advances in Natural Language Processing
No ratings yet
Advances in Natural Language Processing
7 pages
Natural-Language Understanding
No ratings yet
Natural-Language Understanding
5 pages
25EASMarch 3369
No ratings yet
25EASMarch 3369
10 pages
Presentation at Utrecht University On ICT For Intercultural Communication
No ratings yet
Presentation at Utrecht University On ICT For Intercultural Communication
46 pages
Bhel Fina12l
No ratings yet
Bhel Fina12l
29 pages
nlp1
No ratings yet
nlp1
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
73 pages
The Beginner's Guide To Arabic
No ratings yet
The Beginner's Guide To Arabic
24 pages
Class 1 - NLP
No ratings yet
Class 1 - NLP
28 pages
SITA3012 NLP Unit 1
No ratings yet
SITA3012 NLP Unit 1
33 pages
Approaches To AI
0% (1)
Approaches To AI
7 pages
How To Kill A Mocking Bird
100% (1)
How To Kill A Mocking Bird
3 pages
Natural language procesing notes-3-21
No ratings yet
Natural language procesing notes-3-21
19 pages
Saba Farooq
No ratings yet
Saba Farooq
18 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
Longman General Dictionary Worksheets
0% (1)
Longman General Dictionary Worksheets
3 pages
Text Image Relations Picture Books
100% (1)
Text Image Relations Picture Books
6 pages
(IJETA-V11I3P43) :ravi Joshi, Sheikh Mohammed Farhan, Udeshya Sharma, Sahil Bhatt
No ratings yet
(IJETA-V11I3P43) :ravi Joshi, Sheikh Mohammed Farhan, Udeshya Sharma, Sahil Bhatt
6 pages
NLP Presentation
No ratings yet
NLP Presentation
20 pages
Jenkins - Lingua Franca
100% (1)
Jenkins - Lingua Franca
25 pages
Organization Strategy, Structure & Processes
No ratings yet
Organization Strategy, Structure & Processes
18 pages
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
No ratings yet
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
7 pages
ML Module A7707 - Part1
No ratings yet
ML Module A7707 - Part1
48 pages
A Critique of H. P. Grices Pragmatic The PDF
No ratings yet
A Critique of H. P. Grices Pragmatic The PDF
11 pages
NLP Module 1
No ratings yet
NLP Module 1
31 pages
Lexicon+and+grammar+selectividad Key
0% (1)
Lexicon+and+grammar+selectividad Key
13 pages
1_NLP.docx
No ratings yet
1_NLP.docx
26 pages
Keyword Method Application Article
No ratings yet
Keyword Method Application Article
18 pages
Artificial Intelligence: Natural Language Processing
No ratings yet
Artificial Intelligence: Natural Language Processing
41 pages
It Iwb
No ratings yet
It Iwb
15 pages
Semalt Expertise: How Frequently You Should Post On Social Media Platforms
No ratings yet
Semalt Expertise: How Frequently You Should Post On Social Media Platforms
3 pages
Participles, and How Not To Dangle Them
No ratings yet
Participles, and How Not To Dangle Them
3 pages
Forecasting
100% (1)
Forecasting
3 pages
21st Century Activities & Skills: Cce / Ee
No ratings yet
21st Century Activities & Skills: Cce / Ee
6 pages
Evolution, Structure and Processes: Unit 12
No ratings yet
Evolution, Structure and Processes: Unit 12
11 pages
20 Minute Lesson Plan
No ratings yet
20 Minute Lesson Plan
2 pages
Project Management PPT Final
No ratings yet
Project Management PPT Final
14 pages
Female Ha
No ratings yet
Female Ha
11 pages
Applications of AI
No ratings yet
Applications of AI
6 pages
Basic Econometric Models: Linear Regression: Econometrics Is The Application of
No ratings yet
Basic Econometric Models: Linear Regression: Econometrics Is The Application of
4 pages
E T Ibm & HCL: Ffectiveness OF Raining IN
No ratings yet
E T Ibm & HCL: Ffectiveness OF Raining IN
18 pages
Chapter 2 & 3: "Industry Overview, Literature Review & Research Methodology"
No ratings yet
Chapter 2 & 3: "Industry Overview, Literature Review & Research Methodology"
18 pages
Mind Tricks
No ratings yet
Mind Tricks
3 pages
Unit1 A
No ratings yet
Unit1 A
8 pages
CALL (Computer-Assisted Language Learning) and Distance Learning
No ratings yet
CALL (Computer-Assisted Language Learning) and Distance Learning
6 pages
Know Your Role in Deloitte
No ratings yet
Know Your Role in Deloitte
3 pages
NLP Unit 1 to 5
No ratings yet
NLP Unit 1 to 5
91 pages
Tasks in NLP
No ratings yet
Tasks in NLP
7 pages
University of The Cordilleras 406217: A Beacon of Higher Education Beaming From These Majestic Mountain Highlands..
No ratings yet
University of The Cordilleras 406217: A Beacon of Higher Education Beaming From These Majestic Mountain Highlands..
6 pages
Distinguished Toastmaster Progress Chart Communication Track Leadership Track
No ratings yet
Distinguished Toastmaster Progress Chart Communication Track Leadership Track
1 page
Change Management Practitioners International Article
No ratings yet
Change Management Practitioners International Article
2 pages
Variation
No ratings yet
Variation
2 pages
Lesson 4
No ratings yet
Lesson 4
2 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
Busines Research
No ratings yet
Busines Research
1 page
A Beginner's Guide To Natural Language Processing - IBM Developer
No ratings yet
A Beginner's Guide To Natural Language Processing - IBM Developer
9 pages
Summary of Nature of Translations
100% (1)
Summary of Nature of Translations
5 pages
Approaches To Machine Learning
No ratings yet
Approaches To Machine Learning
4 pages
Occupational Health & Safety System ISO-18000
No ratings yet
Occupational Health & Safety System ISO-18000
12 pages
Resources Organization Mission Balanced Scorecard Business
No ratings yet
Resources Organization Mission Balanced Scorecard Business
4 pages
Susie Moore
No ratings yet
Susie Moore
10 pages
Models : Self-Assessment EFQM Excellence Model Malcolm Baldrige National Quality Award
No ratings yet
Models : Self-Assessment EFQM Excellence Model Malcolm Baldrige National Quality Award
2 pages
White Tiger
No ratings yet
White Tiger
1 page
All Change Models
No ratings yet
All Change Models
24 pages
Natural Language Processing: Fundamentals and Applications
From Everand
Natural Language Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Terminology Extraction: Fundamentals and Applications
From Everand
Terminology Extraction: Fundamentals and Applications
Fouad Sabry
No ratings yet
Question Answering: Fundamentals and Applications
From Everand
Question Answering: Fundamentals and Applications
Fouad Sabry
No ratings yet
Natural Language Understanding: Fundamentals and Applications
From Everand
Natural Language Understanding: Fundamentals and Applications
Fouad Sabry
No ratings yet

Uploaded by

Uploaded by

Natural language Processing

From Wikipedia, the free encyclopedia

NLP using machine learning

You might also like