SemanticScuttle - klotz.me » klotz: mixture-of-experts

klotz: mixture-of-experts*

The Illustrated GPT-OSS

OpenAI's release of GPT-OSS marks their first major open source LLM since GPT-2, featuring improvements in reasoning, tool usage, and problem-solving capabilities. The article explores its architecture, message formatting, reasoning modes, and tokenizer details.

2025-08-22 Tags: gpt-oss, openai, llm, reasoning, tool usage, tokenizer, mixture-of-experts by klotz
TIME-MOE: Billion-Scale Time Series Foundation Model with Mixture-of-Experts

This article discusses Time-MOE, an open-source time-series foundation model using Mixture-of-Experts (MOE) to improve forecasting accuracy while reducing computational costs. Key contributions include the Time-300B dataset, scaling laws for time series, and the Time-MOE architecture.

2024-10-31 Tags: time-moe, mixture-of-experts, time-series, forecasting, open-source, data-cleaning pipeline, scaling laws by klotz

First / Previous / Next / Last / Page 1 of 0