How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based 
Response Model

Shin, H; Cho, S

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Konferenzbeitrag

How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based Response Model

MPG-Autoren

Es sind keine MPG-Autoren in der Publikation vorhanden

Externe Ressourcen

https://pdfs.semanticscholar.org/c6b0/789f3fe88ca55a8b7d543227eb20d9d11795.pdf
(beliebiger Volltext)

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

Es sind keine frei zugänglichen Volltexte in PuRe verfügbar

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Shin, H., & Cho, S. (2003). How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based Response Model. In Korean Data Mining Conference 2003 (pp. 93-107).

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0013-DAA3-4

Zusammenfassung

[Abstract]: Various machine learning methods have made a rapid transition to response modeling in search of improved
performance. And support vector machine (SVM) has also been attracting much attention lately. This paper presents an
SVM response model. We are specifically focusing on the how-tos to circumvent practical obstacles, such as how to
face with class imbalance problem, how to produce the scores from an SVM classifier for lift chart analysis, and how
to evaluate the models on accuracy and profit. Besides coping with the intractability problem of SVM training caused
by large marketing dataset, a previously proposed pattern selection algorithm is introduced. SVM training accompanies
time complexity of the cube of training set size. The pattern selection algorithm picks up important training patterns
before SVM response modeling. We made comparison on SVM training results between the pattern selection algorithm and random sampling. Three aspects of SVM response models were evaluated: accuracies, lift chart analysis, and computational efficiency. The SVM trained with selected patterns showed a high accuracy, a high uplift in profit and
in response rate, and a high computational efficiency.