Advanced
Please use this identifier to cite or link to this item: https://digital.lib.ueh.edu.vn/handle/UEH/68745
Full metadata record
DC FieldValueLanguage
dc.contributor.authorXiming Li-
dc.contributor.otherBing Wang-
dc.contributor.otherYue Wang-
dc.contributor.otherJihong Ouyang-
dc.contributor.otherHarish Garg-
dc.contributor.otherDang N. H. Thanh-
dc.date.accessioned2023-05-30T02:27:28Z-
dc.date.available2023-05-30T02:27:28Z-
dc.date.issued2023-
dc.identifier.issn1432-7643 (Print), 1433-7479 (Online)-
dc.identifier.urihttps://digital.lib.ueh.edu.vn/handle/UEH/68745-
dc.description.abstractDataless text classification, i.e., a new paradigm of weakly supervised learning, refers to the task of learning with unlabeled documents and a few predefined representative words of categories, known as seed words. The recent generative dataless methods construct document-specific category priors by using seed word occurrences only; however, such category priors often contain very limited and even noisy supervised signals. To remedy this problem, in this paper, we propose a novel formulation of category prior. First, for each document, we consider its label membership degree by not only counting seed word occurrences, but also using a novel prototype scheme, which captures pseudo-nearest neighboring categories. Second, for each label, we consider its frequency prior knowledge of the corpus, which is also a discriminative knowledge for classification. By incorporating the proposed category prior into the previous generative dataless method, we suggest a novel generative dataless method, namely Weakly Supervised Prototype Topic Model. The experimental results on real-world datasets demonstrate that WSPTM outperforms the existing baseline methods.en
dc.formatPortable Document Format (PDF)-
dc.languageeng-
dc.publisherSpringer-
dc.relation.ispartofSoft Computing-
dc.relation.ispartofseriesVol. 27-
dc.rightsSpringer Nature Switzerland AG.vi
dc.subjectDataless text classification-
dc.subjectTopic modeling-
dc.subjectSeed words-
dc.subjectCategory prior-
dc.subjectPrototype scheme-
dc.titleWeakly supervised prototype topic model with discriminative seed words: modifying the category prior by self-exploring supervised signals-
dc.typeJournal Article-
dc.identifier.doihttps://doi.org/10.1007/s00500-022-07771-9-
ueh.JournalRankingISI, Scopus-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairetypeJournal Article-
item.fulltextOnly abstracts-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
Appears in Collections:INTERNATIONAL PUBLICATIONS
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.