ユーザ行動分析のためのセッション・トークナイザーの... - 岡本一志 - 日本学术振兴会基金(JS...

ユーザ行動分析のためのセッション・トークナイザーの開発

项目来源

日本学术振兴会基金(JSPS)

项目主持人

岡本一志

项目受资助机构

電気通信大学

项目编号

24K21410

立项年度

2024

立项时间

未公开

项目级别

国家级

研究期限

未知 / 未知

受资助金额

6240000.00日元

学科

経済学、経営学およびその関連分野

学科代码

未公开

基金类别

挑戦的研究(萌芽)

关键词

行動系列 ; トークナイザー ; 機械学習 ; 推薦システム ; 時系列データ ; 行動ログ ; 行動分析 ; 情報推薦 ;

参与者

未公开

参与机构

電気通信大学，大学院情報理工学研究科

项目标书摘要：当初計画していた点予測の技術について検討を実施でき,予定通り研究を進めることができたため.さらに,今後の研究発展につながる要素技術についても検討を進められている.本研究は,デジタルプラットフォーム上で観測された行動系列(セッションデータ)の分割点を自動検出し,一定の行動のまとまり(トークン)に分割するセッション・トークナイザーの開発を目指すものである.本年度は,点予測法の適用に取り組んだ.点予測法は,行動間の分割点の有無を2クラス分類問題として予測する手法である.具体的な取り組みとしては,行動系列中の各行動を(1)Item2Vecによる分散表現,(2)大規模言語モデル(LLM)による商品名とブランドの分散表現,(3)商品価格の3種類の特徴量の結合により記述し,予測対象の分割点の前後数行動間の特徴量の類似度を2クラス分類器(トークナイザー)に入力し,予測を行うアプローチを検証している.本研究課題の開始以前に作成していたベンチマークデータセットを用いた評価実験を実施し,有望な2クラス分類器と類似度を算出すべき前後の行動数(ウィンドウサイズ)の特定を試みている.評価実験では,2クラス分類器としてLightGBM,XGBoost,CatBoost,Support Vector Machine,線形回帰の6種類の教師あり学習手法を,ウィンドウサイズには1から4をそれぞれ採用している.実験結果より,LightGBMとウィンドウサイズ2の組み合わせで最良の分割精度(F1-score 0.81)が得られることを確認している.現在,これらの成果をまとめた論文を準備中であり,2025年度中に学術論文誌に投稿することを計画している.本年度に用いたベンチマークデータセットは,人手によるアノテーションで作成したものであるが,今後は,人が行動の分割点と判断する点をLLMはどのように判断するかを評価し,zero-shotやfew-shot学習のアプローチでの分割点の予測にも取り組んでいきたい.本年度の研究成果を学術論文としてまとめて公表していくことに加え,LLMの活用についても検討を進める.Reason:当初計画していた点予測の技術について検討を実施でき,予定通り研究を進めることができたため.さらに,今後の研究発展につながる要素技術についても検討を進められている。Outline of Research at the Start:セッションデータは,デジタルプラットフォーム上の商品やサービスの購入や利用ログとして大量に蓄積されており,その分析・活用は,マーケティングやサービスの改善などに欠かせない.セッションデータ分析では行動系列を一定の行動のまとまり(トークン)に分割するが,分析作業の初手で使えるセッションデータ用分割技術は分析の効率化・高度化に貢献するものである.本研究では,形態素解析技術を参考に,セッションデータ中の分割点を自動検出し,トークンに分割するセッション・トークナイザーを開発する.この開発を通じ,限られた分析者・少ない作業負担の制約下で,セッションデータ分割を効率的に実現する方法論の解明に取り組む。

排序方式：时间相关性
显示方式：列表摘要

1.Function-based labels for complementary recommendation: Definition, annotation, and LLM-as-a-Judge

关键词：
Behavioral research;Classification (of information);Learning systems;Machine learning;Purchasing;Annotation;Binary classification;Complementary recommendation;Complementary relationship;Decision process;Human perception;Language model;Large language model;Machine learning methods;Users' experiences

Yamasaki, Chihiro;Sugahara, Kai;Nagi, Yuma;Okamoto, Kazushi
《Pattern Recognition Letters》
2026年
200卷
期
期刊

Complementary recommendations enhance the user experience by suggesting items that are frequently purchased together while serving different functions from the query item. Inferring or evaluating whether two items have a complementary relationship requires complementary relationship labels; however, defining these labels is challenging because of the inherent ambiguity of such relationships. Complementary labels based on user historical behavior logs attempt to capture these relationships, but often produce inconsistent and unreliable results. Recent efforts have introduced large language models (LLMs) to infer these relationships. However, these approaches provide a binary classification without a nuanced understanding of complementary relationships. In this study, we address these challenges by introducing Function-Based Labels (FBLs), a novel definition of complementary relationships independent of user purchase logs and the opaque decision processes of LLMs. We constructed a human-annotated FBLs dataset comprising 2759 item pairs and demonstrated that it covered possible item relationships and minimized ambiguity. We then evaluated whether machine learning methods using annotated FBLs could accurately infer labels for unseen item pairs, and whether LLM-generated complementary labels align with human perception. Among machine learning methods, ModernBERT achieved the highest performance with a Macro-F1 of 0.911, demonstrating accuracy and robustness even under limited supervision. For LLMs, GPT-4o-mini achieved high consistency (0.989) and classification accuracy (0.849) under the detailed FBL definition, while requiring only 1/842 the cost and 1/75 the time of human annotation. Overall, our study presents FBLs as a clear definition of complementary relationships, enabling more accurate inferences and automated labeling of complementary recommendations. © 2025 Elsevier B.V.

...

2.A Universal Framework for Offline Serendipity Evaluation in Recommender Systems via Large Language Models

关键词：
Knowledge management;Evaluation;Evaluation framework;Ground truth;Language model;Large language model;Offline;Performance;Serendipity;Unobservable;Users' satisfactions

Tokutake, Yu;Okamoto, Kazushi;Harada, Kei;Shibata, Atsushi;Karube, Koki
《34th ACM International Conference on Information and Knowledge Management, CIKM 2025》
2025年
November 10, 2025 - November 14, 2025
Seoul, Korea, Republic of
会议

Serendipity in recommender systems (RSs) has attracted increasing attention as a concept that enhances user satisfaction by presenting unexpected and useful items. However, evaluating serendipitous performance remains challenging because its ground truth is generally unobservable. The existing offline metrics often depend on ambiguous definitions or are tailored to specific datasets and RSs, thereby limiting their generalizability. To address this issue, we propose a universally applicable evaluation framework that leverages large language models (LLMs) known for their extensive knowledge and reasoning capabilities, as evaluators. First, to improve the evaluation performance of the proposed framework, we assessed the serendipity prediction accuracy of LLMs using four different prompt strategies on a dataset containing user-annotated serendipitous ground truth and found that the chain-of-thought prompt achieved the highest accuracy. Next, we re-evaluated the serendipitous performance of both serendipity-oriented and general RSs using the proposed framework on three commonly used real-world datasets, without the ground truth. The results indicated that there was no serendipity-oriented RS that consistently outperformed across all datasets, and even a general RS sometimes achieved higher performance than the serendipity-oriented RS. © 2025 Copyright held by the owner/author(s).

...

排序方式：时间相关性
显示方式：列表摘要