How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based 
Response Model

Shin, H

Datensatz

DATENSATZ AKTIONENEXPORT

DownloadE-Mail

Bitte beachten Sie, dass eine neuere Version dieses Datensatzes verfügbar ist:
https://pure.mpg.de/pubman/item/item_1792211_3

DetailsÜbersicht

How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based Response Model

Shin, H. (2003). How to Deal with Large Dataset, Class Imbalance and Binary Output in SVM based Response Model. In Korean Data Mining Conference (pp. 93-107).

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-DAA3-4 Versions-Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-DAA4-2

Genre: Konferenzbeitrag

ausblenden:

Urheber:
Shin, H¹, Autor

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: [Abstract]: Various machine learning methods have made a rapid transition to response modeling in search of improved performance. And support vector machine (SVM) has also been attracting much attention lately. This paper presents an SVM response model. We are specifically focusing on the how-tos to circumvent practical obstacles, such as how to face with class imbalance problem, how to produce the scores from an SVM classifier for lift chart analysis, and how to evaluate the models on accuracy and profit. Besides coping with the intractability problem of SVM training caused by large marketing dataset, a previously proposed pattern selection algorithm is introduced. SVM training accompanies time complexity of the cube of training set size. The pattern selection algorithm picks up important training patterns before SVM response modeling. We made comparison on SVM training results between the pattern selection algorithm and random sampling. Three aspects of SVM response models were evaluated: accuracies, lift chart analysis, and computational efficiency. The SVM trained with selected patterns showed a high accuracy, a high uplift in profit and in response rate, and a high computational efficiency.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Erschienen: 2003-12

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: BibTex Citekey: 2709

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: Korean Data Mining Conference

Veranstaltungsort: Seoul, Korea

Start-/Enddatum: -

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:

ausblenden:

Titel: Korean Data Mining Conference

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 93 - 107 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1