Pradyuman's Papershelf

πŸ“š Curated Collection of Fascinating Research Papers and Articles

View on GitHub

πŸ“š Research Papers & Articles

Welcome to my collection of fascinating research papers and articles! Here, you’ll find summaries, direct links to PDFs, and my personal notes on each paper I’ve explored. Stay curious and keep learning! 🌟


πŸ“Š Quick Stats


πŸ” Browse by Category

Artificial Intelligence / Machine Learning

Status

Core papers exploring breakthroughs in AI/ML, from foundational architectures to cutting-edge applications.

Browse AI/ML Papers β†’

Data Engineering

Status

Papers focusing on distributed systems, databases, and modern data infrastructure.

Browse Data Engineering Papers β†’

Engineering Systems

Status

Landmark papers about large-scale systems and architectural innovations.

Browse Engineering Systems β†’

Web3 & Cryptography

Status

Research on blockchain technology, cryptocurrencies, and distributed systems.

Browse Web3 Papers β†’

Cyber Security

Status

Research papers focusing on network security, information assurance, threat analysis, and more.

Browse Cyber Security Papers β†’


πŸ“– Recent Reads

πŸ“ Hive: SQL-like Data Warehousing on Hadoop

Read on November 15, 2024

Hey tech enthusiasts! Ever wondered how big companies crunch through terabytes of data using just SQL? Today we’re diving deep into Hive - the tech that makes that possible. It’s SQL, but with superpowers for handling massive datasets across hundreds of machines. Let’s break it down!

πŸ“ Inside MapReduce: The Engine That Powers Large-Scale Data Processing

Read on December 13, 2024

Ever wondered how Google processes petabytes of data efficiently? MapReduce is the elegant solution that revolutionized big data processing by turning complex distributed computations into simple Map and Reduce operations. This article breaks down how it works, and why it’s still relevant today.


πŸ—‚οΈ Report Reading Tracker

Status Report Notes Date Read
βœ… Gartner Marketing Predictions ✍️ 2025-02-05
βœ… Scalable Business Operations for Tech CEOs Primer for 2024 ✍️ 2025-02-13
πŸ“‹ 2024 Data Breach Investigations Report Β  Β 
πŸ“‹ 2025 Strategic Roadmap for Cybersecurity Leadership Β  Β 

πŸ“š Detailed Collections

πŸ”¬ Artificial Intelligence & Machine Learning

Status Paper Notes Date Added
βœ… Attention Is All You Need ✍️ 2024-10-28
πŸ“– Misspecification in Inverse Reinforcement Learning Β  2024-11-28
πŸ“‹ Diffusion Models Are Real-Time Game Engines Β  2024-09-14
πŸ“‹ Textbooks Are All You Need Β  2024-11-27
πŸ“‹ Large Concept Models Β  2024-12-22

πŸ›  Data Engineering

Status Paper Notes Date Added
βœ… A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge ✍️ 2024-07-04
βœ… Hive: A Warehousing Solution Over a Map-Reduce Framework ✍️ 2024-09-22
πŸ“‹ BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse Β  2024-09-22
πŸ“‹ Presto: A Decade of SQL Analytics at Meta Β  2024-07-04
πŸ“‹ Computation Reuse via Fusion in Amazon Athena Β  2024-11-18

πŸš€ Engineering Systems

Status Paper Notes Date Added
βœ… MapReduce: Simplified Data Processing on Large Clusters ✍️ 2024-09-30
πŸ“‹ The Google File System Β  2024-09-30
πŸ“‹ Zanzibar: Google’s Consistent, Global Authorization System Β  2024-09-30

🌐 Blockchain / Cryptography

Status Paper Notes Date Added
πŸ“– Bitcoin Β  2024-07-04
πŸ“‹ Ethereum Β  2024-09-22

πŸ”’ Cyber Security

Status Paper Notes Date Added
πŸ“‹ a_method_for_obtaining_digital_signatures_and_public_key_cryptosystems.pdf Β  2025-02-21
πŸ“‹ reflection_on_trusting_trust.pdf Β  2025-02-21
πŸ“‹ new_directions_in_cryptography.pdf Β  2025-02-21

πŸ“¦ Miscellaneous

Status Paper Notes Date Added
βœ… Advantages and Disadvantages of a Monolithic Repository ✍️ 2024-09-22
πŸ“‹ Bloom Filters: Design Innovations and Novel Applications Β  2024-07-04

Legend:


πŸ“₯ Resources

Last updated: February 21, 2025