Fichier:Improving automated lexical and discourse analysis of online chat dialog (IA improvingutomate109453281).pdf

Taille de cet aperçu JPG pour ce fichier PDF : 463 × 599 pixels. Autres résolutions : 185 × 240 pixels | 371 × 480 pixels | 593 × 768 pixels | 1 275 × 1 650 pixels.

Fichier d’origine ‎(1 275 × 1 650 pixels ; taille du fichier : 600 kio ; type MIME : application/pdf ; 128 pages)

Ce fichier et sa description proviennent de Wikimedia Commons.

Accéder au fichier sur Commons

Description

Improving automated lexical and discourse analysis of online chat dialog ( )
Auteur	Forsyth, Eric N.
Titre	Improving automated lexical and discourse analysis of online chat dialog
Éditeur de publication	Monterey, California. Naval Postgraduate School
Description	One of the goals of natural language processing (NLP) systems is determining the meaning of what is being transmitted. Although much work has been accomplished in traditional written and spoken language domains, little has been performed in the newer computer-mediated communication domain enabled by the Internet, to include text-based chat. This is due in part to the fact that there are no annotated chat corpora available to the broader research community. The purpose of our research is to build a chat corpus, initially tagged with lexical and discourse information. Such a corpus could be used to develop stochastic NLP applications that perform tasks such as conversation thread topic detection, author profiling, entity identification, and social network analysis. During the course of our research, we preserved 477,835 chat posts and associated user profiles in an XML format for future investigation. We privacy-masked 10,567 of those posts and part-of-speech tagged a total of 45,068 tokens. Using the Penn Treebank and annotated chat data, we achieved part-ofspeech tagging accuracy of 90.8%. We also annotated each of the privacy-masked corpus's 10,567 posts with a chat dialog act. Using a neural network with 23 input features, we achieved 83.2% dialog act classification accuracy. Subjects: Computer science
Langue	anglais
Date de publication	septembre 2007
Lieu actuel	IA Collections: navalpostgraduateschoollibrary; fedlink
Numéro d’inventaire	improvingutomate109453281
Source	Internet Archive identifier: improvingutomate109453281 https://archive.org/download/improvingutomate109453281/improvingutomate109453281.pdf
Autorisation (Réutilisation de ce fichier)	Approved for public release, distribution unlimited

Conditions d’utilisation

	Ce média est dans le domaine public* des États-Unis d’Amérique car son auteur est l’administration américaine comme précisé dans le code fédéral au* Titre 17, Chapitre 1, Section 105. Pour en savoir plus : droit d’auteur. Attention : Ceci ne concerne que le travail du Gouvernement Fédéral et pas celui des États, ou d’une autre subdivision géographique ou politique du pays.
Ce fichier a été identifié comme étant exempt de restrictions connues liées au droit d’auteur, y compris tous les droits connexes et voisins.

PDMCreative Commons Public Domain Mark 1.0falsefalse

Historique du fichier

Cliquer sur une date et heure pour voir le fichier tel qu'il était à ce moment-là.

	Date et heure	Vignette	Dimensions	Utilisateur	Commentaire
actuel	22 juillet 2020 à 02:44		1 275 × 1 650, 128 pages (600 kio)	Fæ	FEDLINK - United States Federal Collection improvingutomate109453281 (User talk:Fæ/IA books#Fork8) (batch 1993-2020 #18602)

Utilisation du fichier

La page suivante utilise ce fichier :

Lexicalisation

Métadonnées

Ce fichier contient des informations supplémentaires, probablement ajoutées par l'appareil photo numérique ou le numériseur utilisé pour le créer.

Si le fichier a été modifié depuis son état original, certains détails peuvent ne pas refléter entièrement l'image modifiée.

Titre court	Improving automated lexical and discourse analysis of online chat dialog
Auteur	Forsyth, Eric N.
Logiciel utilisé	Forsyth, Eric N.
Programme de conversion	Acrobat Distiller 8.1.0 (Windows)
Chiffré	no
Taille de la page	612 x 792 pts (letter)
Version du format PDF	1.4