Pradyuman's Papershelf

πŸ“š Curated Collection of Fascinating Research Papers and Articles

View on GitHub

πŸ“š Research Papers & Articles

Welcome to my collection of fascinating research papers and articles! Here, you’ll find summaries, direct links to PDFs, and my personal notes on each paper I’ve explored. Stay curious and keep learning! 🌟


πŸ“Š Quick Stats


πŸ” Browse by Category

Artificial Intelligence / Machine Learning

Status

Core papers exploring breakthroughs in AI/ML, from foundational architectures to cutting-edge applications.

Browse AI/ML Papers β†’

Data Engineering

Status

Papers focusing on distributed systems, databases, and modern data infrastructure.

Browse Data Engineering Papers β†’

Engineering Systems

Status

Landmark papers about large-scale systems and architectural innovations.

Browse Engineering Systems β†’

Web3 & Cryptography

Status

Research on blockchain technology, cryptocurrencies, and distributed systems.

Browse Web3 Papers β†’


πŸ“– Recent Reads

πŸ“ Hive: SQL-like Data Warehousing on Hadoop

Read on November 15, 2024

Hey tech enthusiasts! Ever wondered how big companies crunch through terabytes of data using just SQL? Today we’re diving deep into Hive - the tech that makes that possible. It’s SQL, but with superpowers for handling massive datasets across hundreds of machines. Let’s break it down!

πŸ“ Inside MapReduce: The Engine That Powers Large-Scale Data Processing

Read on December 13, 2024

Ever wondered how Google processes petabytes of data efficiently? MapReduce is the elegant solution that revolutionized big data processing by turning complex distributed computations into simple Map and Reduce operations. This article breaks down how it works, and why it’s still relevant today.


πŸ“š Detailed Collections

πŸ”¬ Artificial Intelligence & Machine Learning

Status Paper Notes Date Added
βœ… Attention Is All You Need ✍️ 2024-10-28
πŸ“– Misspecification in Inverse Reinforcement Learning Β  2024-11-28
πŸ“‹ Diffusion Models Are Real-Time Game Engines Β  2024-09-14
πŸ“‹ Textbooks Are All You Need Β  2024-11-27
πŸ“‹ Large Concept Models Β  2024-12-22

πŸ›  Data Engineering

Status Paper Notes Date Added
βœ… A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge ✍️ 2024-07-04
βœ… Hive: A Warehousing Solution Over a Map-Reduce Framework ✍️ 2024-09-22
πŸ“‹ BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse Β  2024-09-22
πŸ“‹ Presto: A Decade of SQL Analytics at Meta Β  2024-07-04
πŸ“‹ Computation Reuse via Fusion in Amazon Athena Β  2024-11-18

πŸš€ Engineering Systems

Status Paper Notes Date Added
βœ… MapReduce: Simplified Data Processing on Large Clusters ✍️ 2024-09-30
πŸ“‹ The Google File System Β  2024-09-30
πŸ“‹ Zanzibar: Google’s Consistent, Global Authorization System Β  2024-09-30

🌐 Blockchain / Cryptography

Status Paper Notes Date Added
πŸ“– Bitcoin Β  2024-07-04
πŸ“‹ Ethereum Β  2024-09-22

πŸ“¦ Miscellaneous

Status Paper Notes Date Added
βœ… Advantages and Disadvantages of a Monolithic Repository ✍️ 2024-09-22
πŸ“‹ Bloom Filters: Design Innovations and Novel Applications Β  2024-07-04

Legend:


πŸ“₯ Resources

Last updated: December 9, 2024