avatar
Articles
305
Tags
24
Categories
15

Home
Content
  • Paper
  • LLMs
  • Jupyter
  • Note
  • Algorithm
  • PLs
Daily
  • Github
  • Weibo
  • HF
  • Arxiv
Archives
Categories
About
37.2° Blog
Search
Home
Content
  • Paper
  • LLMs
  • Jupyter
  • Note
  • Algorithm
  • PLs
Daily
  • Github
  • Weibo
  • HF
  • Arxiv
Archives
Categories
About
Weibo Hot
Created2019-06-18|weibo
202508 weibo 2025-08-29 weibo 2025-08-28 weibo 2025-08-27 weibo 2025-08-26 weibo 2025-08-25 weibo 2025-08-24 weibo 2025-08-23 weibo 2025-08-22 weibo 2025-08-21 weibo 2025-08-20 weibo 2025-08-19 weibo 2025-08-18 weibo 2025-08-17 weibo 2025-08-16 weibo 2025-08-15 weibo 2025-08-14 weibo 2025-08-13 weibo 2025-08-12 weibo 2025-08-11 weibo 2025-08-10 weibo 2025-08-09 weibo 2025-08-08 weibo 2025-08-07 weibo 2025-08-06 weibo 2025-08-05 weibo 2025-08-04 weibo 2025-08-03 weibo 2025-08-02 weibo 2025-08-01 ...
ArXiv Domain 2025-07-14
Created2019-06-18|AI
数据来源:ArXiv Domain LLM Domain Papers1. One Token to Fool LLM-as-a-JudgeGenerative reward models (also known as LLMs-as-judges), which use large language models (LLMs) to evaluate answer quality, are increasingly adopted in reinforcement learning with verifiable rewards (RLVR). They are often preferred over rigid rule-based metrics, especially for complex reasoning tasks involving free-form outputs. In this paradigm, an LLM is typically prompted to compare a candidate answer against a ground-tru ...
ArXiv Domain 2025-07-15
Created2019-06-18|AI
数据来源:ArXiv Domain LLM Domain Papers1. CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding TasksLarge Language Models (LLMs) have significantly advanced the state-of-the-art in various coding tasks. Beyond directly answering user queries, LLMs can also serve as judges, assessing and comparing the quality of responses generated by other models. Such an evaluation capability is crucial both for benchmarking different LLMs and for improving response quality through response ranking. However, de ...
ArXiv Domain 2025-07-16
Created2019-06-18|AI
数据来源:ArXiv Domain LLM Domain Papers1. CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding TasksLarge Language Models (LLMs) have significantly advanced the state-of-the-art in various coding tasks. Beyond directly answering user queries, LLMs can also serve as judges, assessing and comparing the quality of responses generated by other models. Such an evaluation capability is crucial both for benchmarking different LLMs and for improving response quality through response ranking. However, de ...
ArXiv Domain 2025-07-17
Created2019-06-18|AI
数据来源:ArXiv Domain LLM Domain Papers1. Web-Browsing LLMs Can Access Social Media Profiles and Infer User DemographicsLarge language models (LLMs) have traditionally relied on static training data, limiting their knowledge to fixed snapshots. Recent advancements, however, have equipped LLMs with web browsing capabilities, enabling real time information retrieval and multi step reasoning over live web content. While prior studies have demonstrated LLMs ability to access and analyze websites, thei ...
ArXiv Domain 2025-07-18
Created2019-06-18|AI
数据来源:ArXiv Domain LLM Domain Papers1. Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical JokesHumour, as a complex language form, is derived from myriad aspects of life, whilst existing work on computational humour has focussed almost exclusively on short pun-based jokes. In this work, we investigate whether the ability of Large Language Models (LLMs) to explain humour depends on the particular humour form. We compare models on si ...
GitHub Trending 2025-06-27
Created2019-06-18|GitHub
数据来源:gtrend.yapie.me twentyhq/twentyBuilding a modern alternative to Salesforce, powered by the community. ⭐ Stars: 29974 🍴 Forks: 3456 📝 Language: TypeScript black-forest-labs/fluxOfficial inference repo for FLUX.1 models ⭐ Stars: 22872 🍴 Forks: 1627 📝 Language: Python GraphiteEditor/Graphite2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow. ⭐ Stars: 13645 🍴 Forks: 633 📝 Language: Rust adityachand ...
GitHub Trending 2025-06-28
Created2019-06-18|GitHub
数据来源:gtrend.yapie.me twentyhq/twentyBuilding a modern alternative to Salesforce, powered by the community. ⭐ Stars: 29974 🍴 Forks: 3456 📝 Language: TypeScript black-forest-labs/fluxOfficial inference repo for FLUX.1 models ⭐ Stars: 22872 🍴 Forks: 1627 📝 Language: Python GraphiteEditor/Graphite2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow. ⭐ Stars: 13645 🍴 Forks: 633 📝 Language: Rust adityachand ...
GitHub Trending 2025-06-30
Created2019-06-18|GitHub
数据来源:gtrend.yapie.me GraphiteEditor/Graphite2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow. ⭐ Stars: 15119 🍴 Forks: 680 📝 Language: Rust twentyhq/twentyBuilding a modern alternative to Salesforce, powered by the community. ⭐ Stars: 32034 🍴 Forks: 3571 📝 Language: TypeScript nextcloud/all-in-one📦 The official Nextcloud installation method. Provides easy deployment and maintenance with most features ...
GitHub Trending 2025-07-02
Created2019-06-18|GitHub
数据来源:github.com/trending Any Languagesmicrosoft/generative-ai-for-beginners21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ ⭐ Stars: 89399 🍴 Forks: 📝 Language: Jupyter Notebook NanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 24782 🍴 Forks: 📝 Language: Python zaidmukaddam/sciraScira (Formerly MiniPerplx) is a minimalistic AI-powered search engi ...
GitHub Trending 2025-07-03
Created2019-06-18|GitHub
数据来源:github.com/trending Any Languagesmicrosoft/generative-ai-for-beginners21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ ⭐ Stars: 90135 🍴 Forks: 📝 Language: Jupyter Notebook NanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 24986 🍴 Forks: 📝 Language: Python zaidmukaddam/sciraScira (Formerly MiniPerplx) is a minimalistic AI-powered search engi ...
GitHub Trending 2025-07-04
Created2019-06-18|GitHub
数据来源:github.com/trending Any LanguagesNanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 26240 🍴 Forks: 📝 Language: Python Genesis-Embodied-AI/GenesisA generative world for general-purpose robotics & embodied AI learning. ⭐ Stars: 25622 🍴 Forks: 📝 Language: Python LadybirdBrowser/ladybirdTruly independent web browser ⭐ Stars: 44654 🍴 Forks: 📝 Language: C++ swagger-api/swagger-uiSwagger UI is a colle ...
GitHub Trending 2025-07-05
Created2019-06-18|GitHub
数据来源:github.com/trending Any LanguagesNanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 26460 🍴 Forks: 📝 Language: Python Genesis-Embodied-AI/GenesisA generative world for general-purpose robotics & embodied AI learning. ⭐ Stars: 25663 🍴 Forks: 📝 Language: Python LadybirdBrowser/ladybirdTruly independent web browser ⭐ Stars: 44713 🍴 Forks: 📝 Language: C++ swagger-api/swagger-uiSwagger UI is a colle ...
GitHub Trending 2025-07-06
Created2019-06-18|GitHub
数据来源:github.com/trending Any LanguagesNanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 27284 🍴 Forks: 📝 Language: Python rustfs/rustfs🚀 High-performance distributed object storage for MinIO alternative. ⭐ Stars: 1076 🍴 Forks: 📝 Language: Rust LadybirdBrowser/ladybirdTruly independent web browser ⭐ Stars: 44863 🍴 Forks: 📝 Language: C++ datawhalechina/happy-llm📚 从零开始的大语言模型原理与实践教程 ⭐ Stars: 8271 🍴 For ...
GitHub Trending 2025-07-07
Created2019-06-18|GitHub
数据来源:github.com/trending Any LanguagesNanmiCoder/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫 ⭐ Stars: 27720 🍴 Forks: 📝 Language: Python dockur/macosmacOS inside a Docker container. ⭐ Stars: 14372 🍴 Forks: 📝 Language: Shell anthropics/prompt-eng-interactive-tutorialAnthropic’s Interactive Prompt Engineering Tutorial ⭐ Stars: 14770 🍴 Forks: 📝 Language: Jupyter Notebook vosen/ZLUDACUDA on non-NVIDIA GPUs ⭐ Stars: 1212 ...
1…678…21
avatar
Firefly
A firefly flying freely in the AI domain.
Articles
305
Tags
24
Categories
15
Follow Me
Announcement
Welcome to My Personal Blog!
If Not, Please Visit Gitee Mirror.
Recent Post
检索增强LLM2024-01-13
LLMs公开课 - 6.文本理解和生成大模型2024-01-10
LLMs公开课 - 5.高效训练&模型压缩2024-01-07
Categories
  • AI93
  • Cython1
  • DSA24
  • GitHub62
  • LLMs16
Tags
DSARLTransformerLLMsPaperReadingDeepLearningCVGPTPLdomaingithubhfweiboArXivDomainAIGitHubTrendingHuggingFacePapers微博热搜leetcodealgo
Archives
  • January 20245
  • December 202314
  • November 202326
  • October 20231
  • September 20234
Info
Article :
305
Run time :
Total Count :
11109.8k
UV :
PV :
Last Push :
©2023 - 2025 By Firefly
Search
Loading the Database