4323机器学习中等essayshort
Long Offline Cross-Reference
题目
You are building an offline model over 4000-token documents where answers often depend on matching phrases across distant sections. Latency is less important than capturing those long-range interactions. Which architecture should dominate the shortlist?
解题计时
0:00
提交作答时记录,用于后续平均用时统计。
你的答案