一区二区日本_久久久久久久国产精品_无码国模国产在线观看_久久99深爱久久99精品_亚洲一区二区三区四区五区午夜_日本在线观看一区二区

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

ali-vilab/unianimate-dit ? 15 Apr 2025

Furthermore, we adopt a simple concatenation operation to integrate the reference appearance into the model and incorporate the pose information of the reference image for enhanced pose alignment.

Image Animation

157
1.50 stars / hour

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

lizonghang/prima.cpp ? 7 Apr 2025

Emergency of DeepSeek R1 and QwQ 32B have broken through performance barriers for running frontier large language models (LLMs) on home devices.

Quantization

403
1.46 stars / hour

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

End2End-Diffusion/REPA-E ? ? 14 Apr 2025

We show that while diffusion loss is ineffective, end-to-end training can be unlocked through the representation-alignment (REPA) loss -- allowing both VAE and diffusion model to be jointly tuned during the training process.

103
1.00 stars / hour

Liquid: Language Models are Scalable Multi-modal Generators

foundationvision/liquid ? ? 5 Dec 2024

We present Liquid, an auto-regressive generation paradigm that seamlessly integrates visual comprehension and generation by tokenizing images into discrete codes and learning these code embeddings alongside text tokens within a shared feature space for both vision and language.

Language Modeling Language Modelling +2

515
0.99 stars / hour

Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion

nevsnev/fgdvi ? ? 1 Dec 2024

Specifically, FloED employs a dual-branch architecture, where a flow branch first restores corrupted flow and a multi-scale flow adapter provides motion guidance to the main inpainting branch.

Denoising Optical Flow Estimation +1

189
0.98 stars / hour

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

microsoft/bitnet ? ? 17 Feb 2025

The advent of 1-bit large language models (LLMs), led by BitNet b1. 58, has spurred interest in ternary LLMs.

13,883
0.77 stars / hour

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

sakanaai/ai-scientist-v2 ? ? 10 Apr 2025

AI is increasingly playing a pivotal role in transforming how scientific discoveries are made.

scientific discovery

640
0.74 stars / hour

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

bytedance/ui-tars ? ? 21 Jan 2025

This paper introduces UI-TARS, a native GUI agent model that solely perceives the screenshots as input and performs human-like interactions (e. g., keyboard and mouse operations).

4,129
0.71 stars / hour

BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence

HorizonRobotics/BIP3D ? ? 22 Nov 2024

In embodied intelligence systems, a key component is 3D perception algorithm, which enables agents to understand their surrounding environments.

3D visual grounding

111
0.68 stars / hour
主站蜘蛛池模板: 99久久亚洲 | 亚洲欧洲成人在线 | 一区二区三区四区日韩 | 欧美成人精品一区二区男人看 | 亚洲精品久久嫩草网站秘色 | 在线观看亚洲精品 | 欧美激情精品久久久久久 | 91观看| www亚洲精品 | 国产成人精品午夜 | 欧美五月婷婷 | 中文字幕久久久 | 欧美精品久久久久久久久久 | 国产欧美一区二区三区另类精品 | 日韩精品一区二区三区中文在线 | 日批免费观看 | 欧美在线观看免费观看视频 | 久草久草久草 | 羞羞视频在线网站观看 | 超碰97人人人人人蜜桃 | 久久视频精品 | 米奇7777狠狠狠狠视频 | 婷婷久久综合 | 91大神在线看 | 亚洲综合大片69999 | 国产高清精品在线 | 免费一区二区三区在线视频 | 97色伦网 | 午夜精品久久久久久久久久久久久 | 国产日韩精品视频 | 午夜视频免费在线观看 | 不卡av在线| 精品国产乱码久久久久久影片 | 国产精品1区 | 日韩视频中文字幕 | 精品国产乱码久久久久久a丨 | 日韩成人在线网址 | 欧美精品99 | 亚洲性人人天天夜夜摸 | 国产精彩视频 | 欧美成人精品一区 |