2 3 8

hankai

hankaixyz

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

Lpzhan/openPangu-embedded-gguf

new activity 22 days ago

facebook/MobileLLM-Pro:Seems lagging behind Pangu-1B

upvoted a paper 2 months ago

Benchmarking Optimizers for Large Language Model Pretraining

View all activity

Organizations

None yet

liked a model 4 days ago

Lpzhan/openPangu-embedded-gguf

1B • Updated 4 days ago • 362 • 2

New activity in facebook/MobileLLM-Pro 22 days ago

Seems lagging behind Pangu-1B

👍 👀 5

#4 opened 25 days ago by

hankaixyz

upvoted a paper 2 months ago

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1 • 24

liked 4 models 3 months ago

liked a model 4 months ago

IntervitensInc/pangu-pro-moe-model

Text Generation • 72B • Updated Jul 9 • 21 • 45

upvoted 2 papers over 1 year ago

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published Apr 29, 2024 • 30

DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models

Paper • 2403.00818 • Published Feb 26, 2024 • 19

liked a model over 1 year ago

jamesHD2001/DenseMamba-1.3B

Updated Apr 11, 2024 • 1

authored 9 papers over 1 year ago

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets

Paper • 2010.14819 • Published Oct 28, 2020

GhostNet: More Features from Cheap Operations

Paper • 1911.11907 • Published Nov 27, 2019

Transformer in Transformer

Paper • 2103.00112 • Published Feb 27, 2021 • 1

GhostNetV2: Enhance Cheap Operation with Long-Range Attention

Paper • 2211.12905 • Published Nov 23, 2022

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation

Paper • 2303.11579 • Published Mar 21, 2023

GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Paper • 2306.00693 • Published Jun 1, 2023

Masked Image Modeling with Local Multi-Scale Reconstruction

Paper • 2303.05251 • Published Mar 9, 2023

Augmented Shortcuts for Vision Transformers

Paper • 2106.15941 • Published Jun 30, 2021

Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings

Paper • 2308.12894 • Published Aug 24, 2023

hankai

AI & ML interests

Recent Activity

Organizations

hankaixyz's activity

Seems lagging behind Pangu-1B