Skip to content

Parameter tuning for topic model creation #5

@munterkalmsteiner

Description

@munterkalmsteiner

Experiment whether this makes sense: the optimization target is the paper coverage of the relevant topics, i.e. all papers should be associated with a relevant topic. Garbage topics are the outlier topic (-1) and the ones selected by the user, typically topics generated from copyright notices.

What we would need to do is to train the topic model with some guestimate parameters so we can identify garbage topics. Then we start optimization and evaluate coverage.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions