Integrate all MLX-LM model architectures with proper sharding augmentations for distributed inference in **_dnet_**. Priority based on production deployments, HuggingFace downloads, and benchmark performance - [x] gpt_oss - [x] deepseek_v2 - [ ] deepseek_v3 - [x] llama - [ ] llama4 - [x] qwen3 - [ ] qwen3_moe - [ ] qwen3_next - [ ] qwen2 - [ ] qwen2_moe - [ ] internlm3 - [ ] gemma3 - [ ] gemma3_text - [ ] gemma3n - [ ] glm4 - [ ] glm4_moe - [ ] olmo2 - [ ] olmo3