Stanford CoreNLP
Stanford CoreNLP provides a set of human language technology tools. It can give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, mark up the structure of sentences in terms of phrases and syntactic dependencies, indicate which noun phrases refer to the same entities, indicate sentiment, extract particular or open-class relations between entity mentions, get the quotes people said, etc.
Install Plugin
deepnlpf --install stanfordcorenlp
Suport Language
English: en
Pipeline
View all Anotators.
Before assembling your pipeline, see the dependent annotators.
{
"lang": "en",
"tools": {
"stanfordcorenlp": {
"processors": [
"tokenize",
"ssplit",
"pos",
"lemma",
"ner",
"truecase",
"parse",
"depparse",
"coref"
]
}
}
}
lang: en
tools:
- stanfordcorenlp:
processors:
- tokenize
- ssplit
- pos
- lemma
- ner
- truecase
- parse
- depparse
- coref
Example
from deepnlpf.pipeline import Pipeline
path_dataset = "<path_dir_dataset>"
path_pipeline = "<path_file>/pipeline.json"
nlp = Pipeline(_input=sentence, pipeline=path_pipeline, _output='file')
annotation = nlp.annotate()