Releases model release open-source global

Tech Brief (Oct. 28): Meituan Releases Open-Source Video Generation Model - caixinglobal.com

Google News - Meituan AIOctober 28, 20251 min read0 views

<a href="https://news.google.com/rss/articles/CBMiwgFBVV95cUxQbkppMHVIcHFGRy1mVThXY2h4eFZJbnFpemlNT19vN3Z3QXF5czNGY3NyYkowUTVTd0puTm83N3ZTSHJocE9QZzRzb0VLcVA2a294RzlyaU11dzFub3Y1Tjd1NVB0b282aUp4VGwtOXI0R1UyUVFYQUVTM2RYQk9iYVZpSWdiNmtlN2hnT05qSzdWLVViVFQwSmtlczZnS1gtMk1kdGpRMTVVSS1uYU56N2FXaTZxRmRodUI4anFfYnl4QQ?oc=5" target="_blank">Tech Brief (Oct. 28): Meituan Releases Open-Source Video Generation Model</a> <font color="#6f6f6f">caixinglobal.com</font>

Could not retrieve the full article text.

Read on Google News - Meituan AI →

Original source

Google News - Meituan AI

https://news.google.com/rss/articles/CBMiwgFBVV95cUxQbkppMHVIcHFGRy1mVThXY2h4eFZJbnFpemlNT19vN3Z3QXF5czNGY3NyYkowUTVTd0puTm83N3ZTSHJocE9QZzRzb0VLcVA2a294RzlyaU11dzFub3Y1Tjd1NVB0b282aUp4VGwtOXI0R1UyUVFYQUVTM2RYQk9iYVZpSWdiNmtlN2hnT05qSzdWLVViVFQwSmtlczZnS1gtMk1kdGpRMTVVSS1uYU56N2FXaTZxRmRodUI4anFfYnl4QQ?oc=5

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelreleaseopen-source

ProductsFresh

LGFNet: Local-Global Fusion Network with Fidelity Gap Delta Learning for Multi-Source Aerodynamics

arXiv:2603.29303v1 Announce Type: new Abstract: The precise fusion of computational fluid dynamic (CFD) data, wind tunnel tests data, and flight tests data in aerodynamic area is essential for obtaining comprehensive knowledge of both localized flow structures and global aerodynamic trends across the entire flight envelope. However, existing methodologies often struggle to balance high-resolution local fidelity with wide-range global dependency, leading to either a loss of sharp discontinuities or an inability to capture long-range topological correlations. We propose Local-Global Fusion Network (LGFNet) for multi-scale feature decomposition to extract this dual-natured aerodynamic knowledge. To this end, LGFNet combines a spatial perception layer that integrates a sliding window mechanism

arXiv cs.LG

1mabout 3 hours ago

ModelsFresh

From Physics to Surrogate Intelligence: A Unified Electro-Thermo-Optimization Framework for TSV Networks

arXiv:2603.29268v1 Announce Type: new Abstract: High-density through-substrate vias (TSVs) enable 2.5D/3D heterogeneous integration but introduce significant signal-integrity and thermal-reliability challenges due to electrical coupling, insertion loss, and self-heating. Conventional full-wave finite-element method (FEM) simulations provide high accuracy but become computationally prohibitive for large design-space exploration. This work presents a scalable electro-thermal modeling and optimization framework that combines physics-informed analytical modeling, graph neural network (GNN) surrogates, and full-wave sign-off validation. A multi-conductor analytical model computes broadband S-parameters and effective anisotropic thermal conductivities of TSV arrays, achieving $5\%-10\%$ relative

arXiv cs.LG

1mabout 3 hours ago

ProductsFresh

M2H-MX: Multi-Task Dense Visual Perception for Real-Time Monocular Spatial Understanding

arXiv:2603.29236v1 Announce Type: new Abstract: Monocular cameras are attractive for robotic perception due to their low cost and ease of deployment, yet achieving reliable real-time spatial understanding from a single image stream remains challenging. While recent multi-task dense prediction models have improved per-pixel depth and semantic estimation, translating these advances into stable monocular mapping systems is still non-trivial. This paper presents M2H-MX, a real-time multi-task perception model for monocular spatial understanding. The model preserves multi-scale feature representations while introducing register-gated global context and controlled cross-task interaction in a lightweight decoder, enabling depth and semantic predictions to reinforce each other under strict latency

arXiv cs.CV

1mabout 3 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 240 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Releases

ReleasesFresh

Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting

arXiv:2603.29185v1 Announce Type: new Abstract: Visual relocalization is a fundamental task in the field of 3D computer vision, estimating a camera's pose when it revisits a previously known scene. While point-based hierarchical relocalization methods have shown strong scalability and efficiency, they are often limited by sparse image observations and weak feature matching. In this work, we propose SplatHLoc, a novel hierarchical visual relocalization framework that uses Feature Gaussian Splatting as the scene representation. To address the sparsity of database images, we propose an adaptive viewpoint retrieval method that synthesizes virtual candidates with viewpoints more closely aligned with the query, thereby improving the accuracy of initial pose estimation. For feature matching, we o

arXiv cs.CV

1mabout 3 hours ago

ReleasesFresh

Open Machine Translation for Esperanto

arXiv:2603.29345v1 Announce Type: new Abstract: Esperanto is a widespread constructed language, known for its regular grammar and productive word formation. Besides having substantial resources available thanks to its online community, it remains relatively underexplored in the context of modern machine translation (MT) approaches. In this work, we present the first comprehensive evaluation of open-source MT systems for Esperanto, comparing rule-based systems, encoder-decoder models, and LLMs across model sizes. We evaluate translation quality across six language directions involving English, Spanish, Catalan, and Esperanto using multiple automatic metrics as well as human evaluation. Our results show that the NLLB family achieves the best performance in all language pairs, followed closel

arXiv cs.CL

1mabout 3 hours ago

ReleasesFresh

Dual-Imbalance Continual Learning for Real-World Food Recognition

arXiv:2603.29133v1 Announce Type: new Abstract: Visual food recognition in real-world dietary logging scenarios naturally exhibits severe data imbalance, where a small number of food categories appear frequently while many others occur rarely, resulting in long-tailed class distributions. In practice, food recognition systems often operate in a continual learning setting, where new categories are introduced sequentially over time. However, existing studies typically assume that each incremental step introduces a similar number of new food classes, which rarely happens in real world where the number of newly observed categories can vary significantly across steps, leading to highly uneven learning dynamics. As a result, continual food recognition exhibits a dual imbalance: imbalanced sample

arXiv cs.CV

2mabout 3 hours ago

ReleasesFresh

Efficient Bilevel Optimization with KFAC-Based Hypergradients

arXiv:2603.29108v1 Announce Type: new Abstract: Bilevel optimization (BO) is widely applicable to many machine learning problems. Scaling BO, however, requires repeatedly computing hypergradients, which involves solving inverse Hessian-vector products (IHVPs). In practice, these operations are often approximated using crude surrogates such as one-step gradient unrolling or identity/short Neumann expansions, which discard curvature information. We build on implicit function theorem-based algorithms and propose to incorporate Kronecker-factored approximate curvature (KFAC), yielding curvature-aware hypergradients with a better performance efficiency trade-off than Conjugate Gradient (CG) or Neumann methods and consistently outperforming unrolling. We evaluate this approach across diverse tas

arXiv cs.LG

1mabout 3 hours ago