java.io.Serializable
, java.util.Enumeration<java.lang.String>
, OptionHandler
, RevisionHandler
public class TweetNLPTokenizer extends Tokenizer
@InProceedings{twitterNLP, Title = {Part-of-speech tagging for twitter: Annotation, features, and experiments}, Author = {Gimpel, Kevin and Schneider, Nathan and O'Connor, Brendan and Das, Dipanjan and Mills, Daniel and Eisenstein, Jacob and Heilman, Michael and Yogatama, Dani and Flanigan, Jeffrey and Smith, Noah A}, Booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2}, Year = {2011}, Organization = {Association for Computational Linguistics}, Pages = {42--47} }
Constructor | Description |
---|---|
TweetNLPTokenizer() |
Modifier and Type | Method | Description |
---|---|---|
java.lang.String |
getRevision() |
Returns the revision string.
|
TechnicalInformation |
getTechnicalInformation() |
Returns an instance of a TechnicalInformation object, containing
detailed information about the technical background of this class,
e.g., paper reference or book this class is based on.
|
java.lang.String |
globalInfo() |
Returns a string describing this tokenizer.
|
boolean |
hasMoreElements() |
Tests if this enumeration contains more elements.
|
static void |
main(java.lang.String[] args) |
Runs the tokenizer with the given options and strings to tokenize.
|
java.lang.String |
nextElement() |
Returns the next element of this enumeration if this enumeration object has
at least one more element to provide.
|
void |
tokenize(java.lang.String s) |
Sets the string to tokenize.
|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getOptions, listOptions, runTokenizer, setOptions, tokenize
public java.lang.String globalInfo()
globalInfo
in class Tokenizer
public TechnicalInformation getTechnicalInformation()
public boolean hasMoreElements()
hasMoreElements
in interface java.util.Enumeration<java.lang.String>
hasMoreElements
in class Tokenizer
public java.lang.String nextElement()
nextElement
in interface java.util.Enumeration<java.lang.String>
nextElement
in class Tokenizer
public void tokenize(java.lang.String s)
public java.lang.String getRevision()
public static void main(java.lang.String[] args)
args
- the commandline options and strings to tokenize