二二八79週年掀「台灣史補課潮」，新生代如何與歷史對話？

2026年1月19日 · 杨勇 · 来源：tutorial资讯

Testing LLM reasoning abilities with SAT is not an original idea; there is a recent research that did a thorough testing with models such as GPT-4o and found that for hard enough problems, every model degrades to random guessing. But I couldn't find any research that used newer models like I used. It would be nice to see a more thorough testing done again with newer models.

Wide variety of templates to fit multiple uses

OpenAI sec ，详情可参考旺商聊官方下载

36氪获悉，2月26日收盘，美股三大指数涨跌不一，纳指跌1.18%，标普500指数跌0.54%，道指涨0.03%。大型科技股多数下跌，英伟达跌超5%，市值蒸发2592亿美元（约合人民币1.77万亿元）创去年4月16日以来最大单日跌幅；英特尔跌超3%，特斯拉跌超2%，谷歌、亚马逊跌超1%，苹果小幅下跌；奈飞涨超2%，微软、Meta小幅上涨。热门中概股普跌，百度跌超5%，哔哩哔哩、爱奇艺跌超3%，阿里巴巴、京东、理想汽车、小鹏汽车跌超2%，拼多多、蔚来跌超1%。

Anthropic