SemanticScuttle - klotz.me » Tags: computer science+machine learning

Tags: computer science* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

Personal website of Alex L. Zhang, a PhD student at MIT CSAIL focusing on the efficiency and utilization of language models. His research spans ML systems, language model benchmarks, and specialized model development.
Key areas of work include:
- Recursive Language Models (RLMs) and Project Popcorn
- GPU programming competitions via KernelBot and GPU MODE
- Benchmarking capabilities through VideoGameBench and KernelBench
- Development of models like Neo-1 and KernelLLM-8B

2026-04-25 Tags: mit, csail, machine learning, recursive language models, llm, benchmarking, gpu, computer science, alex l. zhang, github by klotz

Maths, CS & AI Compendium

This is an open, unconventional textbook covering mathematics, computing, and artificial intelligence from foundational principles. It's designed for practitioners seeking a deep understanding, moving beyond exam preparation and focusing on real-world application. The author, drawing from years of experience in AI/ML, has compiled notes that prioritize intuition, context, and clear explanations, avoiding dense notation and outdated material.
The compendium covers a broad range of topics, from vectors and matrices to machine learning, computer vision, and multimodal learning, with future chapters planned for areas like data structures and AI inference.

2026-03-28 Tags: python, nlp, computer science, machine learning, statistics, reinforcement learning, computer vision, deep learning, math, algorithms, linear algebra, probability, mathematics, artificial intelligence, speech processing, multimodal-learning, jax, ai textbook by klotz

Self-Assembly Gets Automated in Reverse of ‘Game of Life’

In cellular automata, simple rules create elaborate structures. Now researchers can start with the structures and reverse-engineer the rules.

2025-09-11 Tags: artificial intelligence, computer science, games, machine learning, patterns by klotz

Can AI really code? Study maps the roadblocks to autonomous software engineering

A new study by MIT CSAIL researchers maps the challenges of AI in software development, identifying bottlenecks and highlighting research directions to move the field forward, aiming to allow humans to focus on high-level design while automating routine tasks.

2025-07-30 Tags: ai, software engineering, machine learning, llm, coding, computer science, mit, csail by klotz

Machine learning for email spam filtering: review, approaches and open research problems

"We present a systematic review of some of the popular machine learning based email spam filtering approaches."

"Our review covers survey of the important concepts, attempts, efficiency, and the research trend in spam filtering."

2024-09-18 Tags: computer science, privacy, machine learning, spam, anti-, filtering, deep learning, neural networks, support vector machines, naive bayes by klotz

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

This paper introduces Cross-Layer Attention (CLA), an extension of Multi-Query Attention (MQA) and Grouped-Query Attention (GQA) for reducing the size of the key-value cache in transformer-based autoregressive large language models (LLMs). The authors demonstrate that CLA can reduce the cache size by another 2x while maintaining nearly the same accuracy as unmodified MQA, enabling inference with longer sequence lengths and larger batch sizes.