skillNer.utils.Utils

class skillNer.utils.Utils(nlp, skills_db)
__init__(nlp, skills_db)

Methods

__init__(nlp, skills_db)

compute_w_ratio(skill_id, matched_tokens)

get_clusters(co_oc)

get_corpus(text, matches)

create a corpus matrix which will be used in future computations.

grouper(iterable, dist)

make_one(cluster, len_)

one_gram_sim(text_str, skill_str)

process_n_gram(matches, text_obj)

apply on conflicted matches to choose which ones to keep

retain(text_obj, span, skill_id, sk_look, corpus)

add doc here

split_at_values(lst, val)