SemanticScuttle - klotz.me » klotz: machine learning+deep learning+ai

klotz: machine learning* + deep learning* + ai*

AlexNet, the AI model that started it all, released in source code form for all to download

AlexNet, a groundbreaking neural network developed in 2012 by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton, has been released in source code form by the Computer History Museum in collaboration with Google. This model significantly advanced the field of AI by demonstrating a massive leap in image recognition capabilities.

2025-03-21 Tags: alexnet, ai, neural network, computer history museum, google, image recognition, deep learning, geoffrey hintonalex krizhevsky, ilya sutskeve by klotz

ByteDance Research Releases DAPO: A Fully Open-Sourced LLM Reinforcement Learning System at Scale

ByteDance Research has released DAPO (Dynamic Sampling Policy Optimization), an open-source reinforcement learning system for LLMs, aiming to improve reasoning abilities and address reproducibility issues. DAPO includes innovations like Clip-Higher, Dynamic Sampling, Token-level Policy Gradient Loss, and Overlong Reward Shaping, achieving a score of 50 on the AIME 2024 benchmark with the Qwen2.5-32B model.

2025-03-21 Tags: llm, reinforcement learning, dapo, open source, bytedance, ai, machine learning, reasoning, aime, qwen2.5 by klotz

AAAI: How AI can achieve human-level intelligence: researchers call for change in tack

AAAI survey finds that most respondents are sceptical that the technology underpinning large-language models is sufficient for artificial general intelligence.

>"More than three-quarters of respondents said that enlarging current AI systems ― an approach that has been hugely successful in enhancing their performance over the past few years ― is unlikely to lead to what is known as artificial general intelligence (AGI). An even higher proportion said that neural networks, the fundamental technology behind generative AI, alone probably cannot match or surpass human intelligence. And the very pursuit of these capabilities also provokes scepticism: less than one-quarter of respondents said that achieving AGI should be the core mission of the AI research community.

2025-03-05 Tags: ai, agi, aaai, neural networks, symbolic ai, nature by klotz

The Illustrated Transformer

A detailed explanation of the Transformer model, a key architecture in modern deep learning for tasks like neural machine translation, focusing on components like self-attention, encoder and decoder stacks, positional encoding, and training.

2024-12-28 Tags: transformer, machine translation, self-attention, encoder-decoder, positional encoding, neural network, ai, deep learning by klotz

Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI? - KDnuggets

Combined with the growing trend of multimodality, or models that combine language, image, and other types of capabilities, we may see a trend of AI models operating more like a committee of different components rather than a monolithic block. This approach actually has many conceptual similarities to a set of interesting ideas described by Marvin Minsky and Seymour Paypert from the early days of AI.

2021-10-03 Tags: deep learning, gpt-3, transformer, switched, attention, nlp, ai, marvin minsky, society of mind by klotz

samrawal/emacs-secondmate: An open-source, mini imitation of GitHub Copilot for Emacs.