Resources

Guides

Online Models

Best AI News Coverage

More comprehensive guides are coming soon. We're creating detailed tutorials and documentation to help you get started.

Papers

Here are some of the most influential papers in AI. We're curating a collection of summaries and key takeaways alongside the original papers. Stay tuned for regular updates.

Attention Is All You Need (2017)

Vaswani et al. • Google Brain & Google Research

Why it matters: Introduced the Transformer architecture, replacing recurrence with self-attention for parallelizable sequence modeling.

Significance: Became the foundation for state-of-the-art models like GPT, BERT, and countless others, revolutionizing natural language processing and beyond.

A Few-Shot Learner (2020)

Brown et al. • OpenAI

Why it matters: Demonstrated that scaling up models and data could lead to emergent capabilities, fundamentally shifting the paradigm from task-specific models to foundation models.

Significance: Cemented the notion of scale and data as key drivers for AI progress.

ImageNet Classification with Deep Convolutional Neural Networks (2012)

Krizhevsky, Sutskever, Hinton • University of Toronto

Why it matters: Showed the power of deep convolutional networks (CNNs) on large datasets, sparking the deep learning revolution.

Significance: Without AlexNet, we might not have had the momentum to pursue Transformers in the first place.

Playing Atari with Deep Reinforcement Learning (2013)

Mnih et al. • DeepMind

Why it matters: Bridged the gap between deep learning and reinforcement learning (RL), showing how neural networks could learn complex tasks directly from pixel input.

Significance: Paved the way for advanced RL models like AlphaGo and MuZero.

Sequence to Sequence Learning with Neural Networks (2014)

Sutskever, Vinyals, Le • Google Brain

Why it matters: Introduced the seq2seq paradigm, enabling models to generate entire sequences of text rather than individual predictions.

Significance: Provided the foundation for machine translation and, eventually, Transformers.

Mastering the Game of Go with Deep Neural Networks and Tree Search (2016)

Silver et al. • DeepMind

Why it matters: AlphaGo showcased the power of combining neural networks with search algorithms to achieve superhuman performance on a complex task.

Significance: Demonstrated AI's potential for strategic reasoning and set the stage for AlphaZero and MuZero.

The Lottery Ticket Hypothesis (2019)

Frankle, Carbin • MIT

Why it matters: Proposed that smaller 'winning subnetworks' exist within large, over-parameterized neural networks, challenging assumptions about scale and training efficiency.

Significance: Inspired a wave of research on model sparsity and efficient AI.

Adam: A Method for Stochastic Optimization (2014)

Kingma, Ba • University of Amsterdam

Why it matters: Introduced the Adam optimizer, which is now the default choice for training neural networks.

Significance: Training Transformers and large neural networks would have been vastly more difficult without it.

Tools

Cursor is the best AI-powered IDE, designed to help developers write, understand, and navigate code more efficiently. It integrates powerful AI capabilities directly into your development workflow -- better than any of the alternatives. (At least as of today, February 26, 2025 -- things change fast around here.)

Ollama

Ollama is a lightweight, open-source tool for running and managing large language models (LLMs) locally on your computer, supporting various model formats and enabling offline inference.

AnythingLLM

Everything great about AI in one desktop application.

Chat with docs, use AI Agents, and more - full locally and offline.

LM Studio

LM Studio is a desktop application for running and managing large language models (LLMs) locally on your computer, supporting various model formats and enabling offline inference.

MCP Servers

Browser Tools MCP

This MCP server lets your client connect directly to your running web browser and capture console logs and network requests. It's a game changer for having an agent (eg in Cursor) debug issues autonomously.

Trello MCP Server

This MCP server that provides tools for interacting with Trello boards including creating tickets, updating tickets, and (as soon as my PR gets accepted), moving tickets. If you don't use Trello, then find the appropriate MCP server for whatever service you do use. It will save you a ton of time and energy.

Postgres MCP Server

This MCP server provides tools for interacting with PostgreSQL databases. It's currently read-only but even that is a game changer allowing agents to be able to query data directly instead of having to be their assistant shuffling around queries and query results.