HuggingFace Papers 2025-12-13
数据来源:HuggingFace Papers
Latest Papers1. T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and PlaygroundWe introduce T-pro 2.0, an open-weight Russian LLM for hybrid reasoning and efficient inference. The model supports direct answering and reasoning-trace generation, using a Cyrillic-dense tokenizer and an adapted EAGLE speculative-decoding pipeline to reduce latency. To enable reproducible and extensible research, we release the model weights, the T-Wix 500k instruction corpus, the T-Ma ...
HuggingFace Papers 2025-12-14
数据来源:HuggingFace Papers
Latest Papers1. T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and PlaygroundWe introduce T-pro 2.0, an open-weight Russian LLM for hybrid reasoning and efficient inference. The model supports direct answering and reasoning-trace generation, using a Cyrillic-dense tokenizer and an adapted EAGLE speculative-decoding pipeline to reduce latency. To enable reproducible and extensible research, we release the model weights, the T-Wix 500k instruction corpus, the T-Ma ...
HuggingFace Papers 2025-12-15
数据来源:HuggingFace Papers
Latest Papers1. T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and PlaygroundWe introduce T-pro 2.0, an open-weight Russian LLM for hybrid reasoning and efficient inference. The model supports direct answering and reasoning-trace generation, using a Cyrillic-dense tokenizer and an adapted EAGLE speculative-decoding pipeline to reduce latency. To enable reproducible and extensible research, we release the model weights, the T-Wix 500k instruction corpus, the T-Ma ...
HuggingFace Papers 2025-12-06
数据来源:HuggingFace Papers
Latest Papers1. DAComp: Benchmarking Data Agents across the Full Data Intelligence LifecycleReal-world enterprise data intelligence workflows encompass data engineering that turns raw sources into analytical-ready tables and data analysis that convert those tables into decision-oriented insights. We introduce DAComp, a benchmark of 210 tasks that mirrors these complex workflows. Data engineering (DE) tasks require repository-level engineering on industrial schemas, incl ...
HuggingFace Papers 2025-12-17
数据来源:HuggingFace Papers
Latest Papers1. ReFusion: A Diffusion Large Language Model with Parallel Autoregressive DecodingAutoregressive models (ARMs) are hindered by slow sequential inference. While masked diffusion models (MDMs) offer a parallel alternative, they suffer from critical drawbacks: high computational overhead from precluding Key-Value (KV) caching, and incoherent generation arising from learning dependencies over an intractable space of token combinations. To address these limitat ...
HuggingFace Papers 2025-12-18
数据来源:HuggingFace Papers
Latest Papers1. MMGR: Multi-Modal Generative ReasoningVideo foundation models generate visually realistic and temporally coherent content, but their reliability as world simulators depends on whether they capture physical, logical, and spatial constraints. Existing metrics such as Frechet Video Distance (FVD) emphasize perceptual quality and overlook reasoning failures, including violations of causality, physics, and global consistency. We introduce MMGR (Multi-Modal Ge ...
HuggingFace Papers 2025-12-19
数据来源:HuggingFace Papers
Latest Papers1. Step-GUI Technical ReportRecent advances in multimodal large language models unlock unprecedented opportunities for GUI automation. However, a fundamental challenge remains: how to efficiently acquire high-quality training data while maintaining annotation reliability? We introduce a self-evolving training pipeline powered by the Calibrated Step Reward System, which converts model-generated trajectories into reliable training signals through trajectory-l ...
HuggingFace Papers 2025-12-20
数据来源:HuggingFace Papers
Latest Papers1. Kling-Omni Technical ReportWe present Kling-Omni, a generalist generative framework designed to synthesize high-fidelity videos directly from multimodal visual language inputs. Adopting an end-to-end perspective, Kling-Omni bridges the functional separation among diverse video generation, editing, and intelligent reasoning tasks, integrating them into a holistic system. Unlike disjointed pipeline approaches, Kling-Omni supports a diverse range of user in ...
HuggingFace Papers 2025-12-21
数据来源:HuggingFace Papers
Latest Papers1. Kling-Omni Technical ReportWe present Kling-Omni, a generalist generative framework designed to synthesize high-fidelity videos directly from multimodal visual language inputs. Adopting an end-to-end perspective, Kling-Omni bridges the functional separation among diverse video generation, editing, and intelligent reasoning tasks, integrating them into a holistic system. Unlike disjointed pipeline approaches, Kling-Omni supports a diverse range of user in ...
HuggingFace Papers 2025-12-22
数据来源:HuggingFace Papers
Latest Papers1. Kling-Omni Technical ReportWe present Kling-Omni, a generalist generative framework designed to synthesize high-fidelity videos directly from multimodal visual language inputs. Adopting an end-to-end perspective, Kling-Omni bridges the functional separation among diverse video generation, editing, and intelligent reasoning tasks, integrating them into a holistic system. Unlike disjointed pipeline approaches, Kling-Omni supports a diverse range of user in ...
HuggingFace Papers 2025-12-23
数据来源:HuggingFace Papers
Latest Papers1. Probing Scientific General Intelligence of LLMs with Scientist-Aligned WorkflowsDespite advances in scientific AI, a coherent framework for Scientific General Intelligence (SGI)-the ability to autonomously conceive, investigate, and reason across scientific domains-remains lacking. We present an operational SGI definition grounded in the Practical Inquiry Model (PIM: Deliberation, Conception, Action, Perception) and operationalize it via four scientist-a ...
HuggingFace Papers 2025-12-24
数据来源:HuggingFace Papers
Latest Papers1. DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AIThe rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc scripts and loosely specified workflows, which lack principled abstractions, hinder reproducibility, and off ...
HuggingFace Papers 2025-12-25
数据来源:HuggingFace Papers
Latest Papers1. SemanticGen: Video Generation in Semantic SpaceState-of-the-art video generative models typically learn the distribution of video latents in the VAE space and map them to pixels using a VAE decoder. While this approach can generate high-quality videos, it suffers from slow convergence and is computationally expensive when generating long videos. In this paper, we introduce SemanticGen, a novel solution to address these limitations by generating videos in ...
HuggingFace Papers 2025-12-26
数据来源:HuggingFace Papers
Latest Papers1. TurboDiffusion: Accelerating Video Diffusion Models by 100-200 TimesWe introduce TurboDiffusion, a video generation acceleration framework that can speed up end-to-end diffusion generation by 100-200x while maintaining video quality. TurboDiffusion mainly relies on several components for acceleration: (1) Attention acceleration: TurboDiffusion uses low-bit SageAttention and trainable Sparse-Linear Attention (SLA) to speed up attention computation. (2) St ...
HuggingFace Papers 2025-12-16
数据来源:HuggingFace Papers
Latest Papers1. EgoX: Egocentric Video Generation from a Single Exocentric VideoEgocentric perception enables humans to experience and understand the world directly from their own point of view. Translating exocentric (third-person) videos into egocentric (first-person) videos opens up new possibilities for immersive understanding but remains highly challenging due to extreme camera pose variations and minimal view overlap. This task requires faithfully preserving visib ...