Close
Help
Need Help?





JOURNAL

Biomedical Informatics Insights

120,195 Journal Article Views | Journal Analytics

Suicide Note Sentiment Classification: A Supervised Approach Augmented by Web Data

Submit a Paper



Publication Date: 30 Jan 2012

Type: Original Research

Journal: Biomedical Informatics Insights

Citation: Biomedical Informatics Insights 2012:5 (Suppl. 1) 31-41

doi: 10.4137/BII.S8956

Abstract

Objective: To create a sentiment classification system for the Fifth i2b2/VA Challenge Track 2, which can identify thirteen subjective categories and two objective categories.

Design: We developed a hybrid system using Support Vector Machine (SVM) classifiers with augmented training data from the Internet. Our system consists of three types of classification-based systems: the first system uses spanning n-gram features for subjective categories, the second one uses bag-of-n-gram features for objective categories, and the third one uses pattern matching for infrequent or subtle emotion categories. The spanning n-gram features are selected by a feature selection algorithm that leverages emotional corpus from weblogs. Special normalization of objective sentences is generalized with shallow parsing and external web knowledge. We utilize three sources of web data: the weblog of LiveJournal which helps to improve the feature selection, the eBay List which assists in special normalization of information and instructions categories, and the suicide project web which provides unlabeled data with similar properties as suicide notes.

Measurements: The performance is evaluated by the overall micro-averaged precision, recall and F-measure.

Result: Our system achieved an overall micro-averaged F-measure of 0.59. Happiness_peacefulness had the highest F-measure of 0.81. We were ranked as the second best out of 26 competing teams.

Conclusion: Our results indicated that classifying fine-grained sentiments at sentence level is a non-trivial task. It is effective to divide categories into different groups according to their semantic properties. In addition, our system performance benefits from external knowledge extracted from publically available web data of other purposes; performance can be further enhanced when more training data is available.


Downloads

PDF  (595.17 KB PDF FORMAT)

RIS citation   (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation   (BIBDESK, LATEX)

XML




Our Service Promise

  • Prompt Processing (3 Weeks to Editorial Decision)
  • Fair, Independent Peer Review
  • High Visibility & Extensive Indexing
What Your Colleagues Say About Biomedical Informatics Insights
testimonial_image
It's a great experience publishing with Biomedical Informatics Insights. I am particularly impressed with the in-depth and constructive comments provided by the reviewers within such a short time-frame. The typesetting was not only prompt, but most importantly, effective. In fact, this was among the very few publication experiences that I have had when no correction was needed in the author proofs. I highly recommend Biomedical Informatics Insights to both readers and prospective ...
Dr Chun Hsi Huang (Computer Science and Engineering, University of Connecticut)
More Testimonials

Quick Links




Follow Us We make it easy to find new research papers.




SUBJECT HUBS
Author Survey Results
author_survey_results
All authors are surveyed after their articles are published. Authors are asked to rate their experience in a variety of areas, and their responses help us to monitor our performance. Presented here are their responses in some key areas. No 'poor' or 'very poor' responses were received; these are represented in the 'other' category.
See Our Results