13 Commits

Author SHA1 Message Date
Gea-Suan Lin
d72fe86325 Use id instead of article.Id. 2024-02-28 21:54:15 +08:00
Gea-Suan Lin
53d0b162d1 Avoid from getting article number repeatly. 2024-02-28 06:06:38 +08:00
Gea-Suan Lin
2d1c6f161a Check arguments. 2024-02-18 22:00:43 +08:00
Gea-Suan Lin
64a2507631 Add link so that I can validate quickly. 2024-02-18 21:45:29 +08:00
Gea-Suan Lin
79b23c32f2 Show article id only on score > 0. 2024-02-12 05:21:17 +08:00
Gea-Suan Lin
1e5b1dcf9a Use os.Args[1] as query string, also lowercase all the time. 2024-02-11 14:32:09 +08:00
Gea-Suan Lin
d6e1c1dbf5 Implement query to tf-idf score. 2024-02-09 15:16:47 +08:00
Gea-Suan Lin
2d447ad45b Change TF from [id][term] to [term][id]. 2024-02-09 14:33:07 +08:00
Gea-Suan Lin
ade2049093 Implement TF & DF in tf-idf. 2024-02-09 14:20:08 +08:00
Gea-Suan Lin
18fbfa7292 Rename tokenize to tokenizer. 2024-02-09 11:47:13 +08:00
Gea-Suan Lin
ce79d2b245 Implement tokenize(). 2024-02-09 11:46:19 +08:00
Gea-Suan Lin
28c1df566d Implement the first part of tfidf. 2024-01-31 09:43:30 +08:00
Gea-Suan Lin
6ee597dc7f Add a skeleton of ir-tfidf and its related settings. 2024-01-29 00:49:19 +08:00