Analysis of the Use of Background Distribution for Naive Bayes Classifiers

Daniel Andrade; Akihiro Tamura; Masaaki Tsuchida

Download from

dx.doi.org

More download options

Analysis of the Use of Background Distribution for Naive Bayes Classifiers

Daniel Andrade, Akihiro Tamura & Masaaki Tsuchida

Journal of Intelligent Systems 28 (2):259-273 (2019) Copy BIBT_EX

Abstract

The naive Bayes classifier is a popular classifier, as it is easy to train, requires no cross-validation for parameter tuning, and can be easily extended due to its generative model. Moreover, recently it was shown that the word probabilities estimated from large unlabeled corpora could be used to improve the parameter estimation of naive Bayes. However, previous methods do not explicitly allow to control how much the background distribution can influence the estimation of naive Bayes parameters. In contrast, we investigate an extension of the graphical model of naive Bayes such that a word is either generated from a background distribution or from a class-specific word distribution. We theoretically analyze this model and show the connection to Jelinek-Mercer smoothing. Experiments using four standard text classification data sets show that the proposed method can statistically significantly outperform previous methods that use the same background distribution.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

Daniel Andrade

Keywords

Add keywords

Reprint years

DOI

10.1515/jisys-2017-0016

Other Versions

No versions found

My notes

Analytics

Added to PP
2017-12-14

Downloads
16 (#1,199,504)

6 months
2 (#1,691,363)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Daniel Andrade

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Analysis of the Use of Background Distribution for Naive Bayes Classifiers

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work