Artemiy Zhukov's picture

Artemiy Zhukov

dog-god

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

ai-sage/GigaChat3-702B-A36B-preview

liked a model 16 days ago

Downtown-Case/GLM-4.6-128GB-RAM-IK-GGUF

liked a model 17 days ago

salakash/SamKash-Tolstoy

View all activity

Organizations

None yet

upvoted 2 collections 5 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated 14 days ago • 178

RpR Models

RpR (RolePlay with Reasoning) models which are built on RPMax datasets with properly trained multi-turn reasoning. • 8 items • Updated Jun 25 • 13

upvoted a paper 5 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 271

upvoted a paper 7 months ago

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 25

upvoted 2 collections 7 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 25 days ago • 234

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Jun 30 • 133

upvoted 2 papers 8 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86

Sample, Don't Search: Rethinking Test-Time Alignment for Language Models

Paper • 2504.03790 • Published Apr 4 • 3

upvoted 2 papers about 1 year ago

Language Models Learn to Mislead Humans via RLHF

Paper • 2409.12822 • Published Sep 19, 2024 • 11

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

upvoted 9 papers over 1 year ago

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22, 2024 • 26

Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 18

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published Jun 24, 2024 • 23

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Paper • 2406.09416 • Published Jun 13, 2024 • 29

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 103

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19, 2024 • 57

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Paper • 2403.14148 • Published Mar 21, 2024 • 21

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 70

upvoted a paper almost 2 years ago

λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

Paper • 2402.05195 • Published Feb 7, 2024 • 19