anemll
/

anemll-Qwen-Qwen3-0.6B-FP16-ctx512_0.3.3

Apple Neural Engine

Model card Files Files and versions

anemll-Qwen-Qwen3-0.6B-FP16-ctx512_0.3.3 / meta.yaml

anemll's picture

Upload folder using huggingface_hub

1954902 verified 5 months ago

history blame contribute delete

590 Bytes

	model_info:
	name: anemll-Qwen-Qwen3-0.6B-ctx512
	version: 0.3.3
	description: \|
	Demonstarates running Qwen-Qwen3-0.6B on Apple Neural Engine
	Context length: 512
	Batch size: 64
	Chunks: 1
	license: MIT
	author: Anemll
	framework: Core ML
	language: Python
	architecture: qwen3
	parameters:
	context_length: 512
	batch_size: 64
	lut_embeddings: none
	lut_ffn: none
	lut_lmhead: none
	num_chunks: 1
	model_prefix: qwen
	embeddings: qwen_embeddings.mlmodelc
	lm_head: qwen_lm_head.mlmodelc
	ffn: qwen_FFN_PF.mlmodelc
	split_lm_head: 16