Skip to content
@kuhumcst

Centre for Language Technology, University of Copenhagen

Popular repositories Loading

  1. cstlemma cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervis…

    C++ 36 6

  2. stucco stucco Public archive

    An experimental adaptive UI toolkit.

    Clojure 31 1

  3. DanNet DanNet Public

    The Danish WordNet as an RDF graph.

    Clojure 21

  4. xml-hiccup xml-hiccup Public

    Convert XML into Hiccup in Clojure and ClojureScript.

    Clojure 21 1

  5. taggerXML taggerXML Public

    Modernized version of Eric Brill's Part Of Speech tagger.

    C++ 17 6

  6. tf-idf tf-idf Public

    A reasonably performant TF-IDF implementation.

    Clojure 12 1

Repositories

Showing 10 of 64 repositories
  • kuhumcst/danish-semantic-reasoning-benchmark’s past year of commit activity
    1 0 0 0 Updated Apr 2, 2025
  • cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

    kuhumcst/cstlemma’s past year of commit activity
    C++ 36 GPL-2.0 6 1 0 Updated Apr 1, 2025
  • affixtrain Public

    Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.

    kuhumcst/affixtrain’s past year of commit activity
    C++ 4 GPL-2.0 0 0 0 Updated Mar 31, 2025
  • texton Public

    Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs

    kuhumcst/texton’s past year of commit activity
    PHP 4 0 1 0 Updated Mar 31, 2025
  • gml Public

    Create training sets for tagger and lemmatiser for Middle Low German.

    kuhumcst/gml’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Mar 25, 2025
  • texton-Java Public

    Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).

    kuhumcst/texton-Java’s past year of commit activity
    Java 2 GPL-3.0 2 0 0 Updated Mar 19, 2025
  • clarin-tei Public Forked from kuhumcst/glossematics
    kuhumcst/clarin-tei’s past year of commit activity
    Clojure 0 1 3 0 Updated Feb 25, 2025
  • hiccup-tools Public

    Navigate and manipulate Hiccup documents.

    kuhumcst/hiccup-tools’s past year of commit activity
    Clojure 1 0 1 0 Updated Jan 29, 2025
  • DanNet Public

    The Danish WordNet as an RDF graph.

    kuhumcst/DanNet’s past year of commit activity
    Clojure 21 MIT 0 34 0 Updated Jan 7, 2025
  • kuhumcst/clarin-landing-page’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 18, 2024

Top languages

Loading…

Most used topics

Loading…