llama.cppã«MoEã«é©ããCPU/GPUã®æ¯ãåãã®ãªãã·ã§ã³ãå ¥ã£ã¦ãLM Studioã§ããã®ãªãã·ã§ã³ã«å¯¾å¿ãããã¨ã«ãã£ã¦ãMoEã¢ãã«ã§ããGPT-ossãå°ãªãGPUã¡ã¢ãªã§ããããªãã«åãããã«ãªãã¾ãããæ¡å¤§ããã¨ãããã¾ãããLM Studioã®å³ä¸ã®è¡¨ç¤ºã«ããã¨ãã¡ã¤ã³ã¡ã¢ãªã¯12GBããã使ãã¾ãã 14tok/secåºã¦ãã¾ãã CPUã ãã§åããã¨10tok/secã ã£ãã®ã§ã5å²ãã·ã§ããã 0.3.23.0ã«ãForce Model Expert weight onto CPUãã¨ããã¹ã¤ãããå ¥ã£ã¦ããã®ã§ããããOnã«ããã¨Expertã®ã¦ã§ã¤ãããã¹ã¦CPUã«ä¹ãããã«ãªãã¾ããã¢ãã³ã·ã§ã³ã¯GPUã§ã 詳ããã¯ãªãªã¼ã¹ãã¼ãã«ããã¾ãããllama.cppã®--n-cpu-moeã®ä»çµã¿ã使ã£ã¦ãã¨ã®ãã¨ã https://lmstudio.a


{{#tags}}- {{label}}
{{/tags}}