English Sentence Part of Speech Tagset Reference202


In computational linguistics, part-of-speech tagging (POS tagging) is the process of assigning a grammatical category (part of speech) to each word in a sentence. This information is used in natural language processing (NLP) tasks such as parsing, named entity recognition, and machine translation.

The English Sentence Part of Speech Tagset is a set of 45 tags that are used to annotate the parts of speech in English sentences. The tagset was developed in the 1980s and 1990s and is widely used in NLP research and development.

The following table lists the tags in the English Sentence Part of Speech Tagset, along with their descriptions:| Tag | Description |
|---|---|
| CC | Coordinating conjunction |
| CD | Cardinal number |
| DT | Determiner |
| EX | Existential there |
| FW | Foreign word |
| IN | Preposition or subordinating conjunction |
| JJ | Adjective |
| JJR | Comparative adjective |
| JJS | Superlative adjective |
| LS | List item marker |
| MD | Modal verb |
| NN | Common noun, singular |
| NNS | Common noun, plural |
| NNP | Proper noun, singular |
| NNPS | Proper noun, plural |
| PDT | Predeterminer |
| POS | Possessive ending |
| PRP | Personal pronoun |
| PRPS | Possessive pronoun |
| RB | Adverb |
| RBR | Comparative adverb |
| RBS | Superlative adverb |
| RP | Particle |
| TO | To |
| UH | Interjection |
| VB | Verb, base form |
| VBD | Verb, past tense |
| VBG | Verb, present participle or gerund |
| VBN | Verb, past participle |
| VBP | Verb, present tense, non-3rd person singular |
| VBZ | Verb, present tense, 3rd person singular |
| WDT | Wh-determiner |
| WP | Wh-pronoun |
| WP$ | Possessive wh-pronoun |
| WRB | Wh-adverb |
| | Punctuation |
| # | Number |
| $ | Currency |
| `` | Quoted material |
| -LRB- | Left parenthesis |
| -RRB- | Right parenthesis |
| -LCB- | Left curly bracket |
| -RCB- | Right curly bracket |
| -LSB- | Left square bracket |
| -RSB- | Right square bracket |
| -LLB- | Left angle bracket |
| -RRB- | Right angle bracket |
| `` | Backtick |
| ~ | Tilde |

In addition to the tags listed above, the English Sentence Part of Speech Tagset also includes several special tags that are used to indicate the beginning and end of a sentence, as well as to mark errors or unknown words. These tags are as follows:| Tag | Description |
|---|---|
| | Sentence start |
| | Sentence end |
| | Unknown word |
| | Error |

The English Sentence Part of Speech Tagset is a valuable resource for NLP research and development. It provides a consistent and reliable way to annotate the parts of speech in English sentences, which can be used to improve the performance of NLP tasks such as parsing, named entity recognition, and machine translation.

2024-11-19


上一篇:CAD 快速且高效的箭头标注指南

下一篇:如何使用 AutoCAD 创建精确的斜线长度标注