Note that schedule is subject to change
Need to be on campus, or use VPN to access some papers
Or, change "dl.acm.org/" to "dl-acm-org.proxy.library.cornell.edu/" in the URL
# |
Who (click for slides) |
Date |
Topic |
Required Reading |
Suggested Reading |
1 |
Hakim
(pdf pptx) |
8/23 | Course overview | ||
2 |
Hakim
(pdf pptx) |
8/28 | Building Large, Principled Systems |
End-to-end arguments
in system design, J.H. Saltzer, D.P. Reed, D.D. Clark. ACM
Transactions on Computer Systems Volume 2, Issue 4 (November 1984),
pages 277--288. Hints for computer system design, B. Lampson. Proceedings of the Ninth ACM Symposium on Operating Systems Principles (Bretton Woods, New Hampshire, United States) 1983, pages 33--48. |
The Impact of
Architectural Trends on Operating System Performance Rosenblum et al.
15th SOSP, 1995. Interposition Agents: Transparently Interposing User Code at the System Interface, Michael Jones. 14th SOSP, 1993, pages 80--93. |
3 |
Hakim
(pdf pptx) |
8/30 | Classic Systems |
The UNIX
time-sharing system, Dennis M. Ritchie and Ken Thompson.
Communications of the ACM Volume 17, Issue 7 (July 1974), pages
365--375. The structure of the "THE"-multiprogramming system, E.W. Dijkstra. Communications of the ACM Volume 11, Issue 5 (May 1968), pages 341--346. |
Using continuations
to implement thread management and communication in operating systems,
Richard P. Draves, Brian N. Bershad, Richard F. Rashid, and Randall W. Dean.
Proceedings of the Thirteenth ACM Symposium on Operating Systems
Principles (Pacific Grove, California, 1991), pages 122--136. The nucleus of a multiprogramming system, P, Brinch Hansen. Communications of the ACM Volume 13, Issue 4 (April 1970), pages 238--241. The Duality of Memory and Communication in the Implementation of a Multiprocessor Operating System, M. Young, A Tavanian, R. Rashid, D. Golub, and J. Eppinger. Proceedings of the Eleventh ACM Symposium on Operating Systems Principles (Austin, Texas, United States), ACM, 1987, pages 63--76. |
4 |
Hakim
(pdf pptx) |
9/4 | Bit Coin, Block Chains |
Required: Bitcoin: A
peer-to-peer electronic cash system. Nakamoto, Satoshi. Consulted
1.2012 (2008): 28. optional: Bitcoin and Cryptocurrency Technologies: A Comprehensive Introduction. A. Narayanan, J. Bonneau, E. Felten, A. Miller, S. Goldfeder. Princeton University Press; 2016. Optional: Majority is not enough: Bitcoin mining is vulnerable. E. Ittay, and E. Sirer. arXiv preprint arXiv:1311.0243 (2013). |
The Bitcoin Backbone Protocol: Analysis and Applications. Juan A. Garay, Aggelos Kiayias, Nikos Leonardos. Nov. 2014 (ArXiV).
Information Propagation in the Bitcoin Network. Christian Decker,
Roger Wattenhofer. 13th nal Conference on Peer-to-Peer Computing,
2013. |
5 |
Hakim
(pdf pptx) |
9/6 | Classic File Systems |
required: The Design and Implementation of a Log-Structured File System, Mendel Rosenblum and Ousterhout. Proceedings of the thirteenth ACM symposium on Operating systems principles, October 1991, pages 1--15. optional: A Fast File System for UNIX. Marshall K. McKusick, William N. Joy, Samuel J. Leffler, Robert S. Fabry. ACM TOCS 2(3), Aug 1984, pages 181--197. |
Soft Updates: A Solution to the Metadata Update problem in File Systems. Gregory R. Ganger, Marshall Kirk McKusick, Craig A. N. Soules, Yale N. Patt. ACM TOCS 18(2). May 2000, pages 127--153. The Zebra striped network file system, John H. Hartman and John K. Ousterhout. Proceedings of the fourteenth ACM symposium on Operating systems principles, 1993, pages 29--43. When to forget in the Elephant file system, Douglas S. Santry, Michael J. Feeley, Norman C. Hutchinson, Alistair C. Veitch, Ross W. Carton, and Jacob Ofir. Proceedings of the seventeenth ACM symposium on Operating systems principles, December 1999, pages 110--123. Towards weakly consistent local storage systems, Ji-Yong Shin, Mahesh Balakrishnan, Tudor Marian, Jakub Szefer, Hakim Weatherspoon. 5th ACM Symposium on Cloud Computing (SoCC), October 2016. |
6 |
Hakim
(pdf pptx) |
9/11 | Concurrency, Threads, and Events |
On the duality of operating system structures, H. C. Lauer and R. M. Needham. ACM SIGOPS Operating Systems Review Volume 12, Issue 2 (April 1979), pages 3--19. SEDA: An Architecture for Well Conditioned, Scalable Internet Services, Matt Welsch, David Culler, and Eric Brewer. Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles (Banff, Alberta, Canada, 2001), pages 230--243. |
Goal-oriented programming, or composition using events,or threads considered harmful, Robbert Van Renesse. Proceeding of the 8th ACM SIGOPS European Workshop on Support for Composing Distributed Applications
(Sintra, Portugal, 1998), pages 82--87. Using Threads in Interactive Systems. Hauser et al. 14th SOSP, Dec 1993. An introduction to programming with threads, A. Birrell. Technical Report, Microsoft Research, May 2005. Capriccio: Scalable threads for internet services, Rob von Behren et al. Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles (Bolton Landing, NY, 2003), pages 268--281. Events can make sense, Maxwell Krohn, Eddie Kohler, and M. Frans Kaashoek. Proceedings of the 2007 USENIX Annual Technical Conference
(Santa Clara, CA, June 2007), pages 87--107.
Using continuations to implement thread management and communication in operating systems, Richard P. Draves, Brian N. Bershad, Richard F. Rashid, and Randall W. Dean.
Proceedings of the Thirteenth ACM Symposium on Operating Systems Principles
(Pacific Grove, California, 1991), pages 122--136. Eraser: A Dynamic Data Race Detector for Multi-Threaded Programs, Savage, Stefan et al. ACM Transactions on Complete Systems (TOCS), 1997. |
7 |
Hakim
(pdf pptx) |
9/13 |
µ-Kernels |
Mach: A new kernel foundation for UNIX development, Mike Accetta, Robert Baron, William Bolosky, David Golub, Richard Rashid, Avadis Tevanian, and Michael Young. Proceedings of the USENIX Summer Conference, Atlanta, GA, 1986, pages 93--112. The Performance of µ-Kernel-based Systems. Härtig et al. 16th SOSP, Oct 1997. |
Operating system support for database management, Michael Stonebraker. Commununications of the ACM Vol 24, no. 7 (July 1981), pp. 412--418. The Duality of Memory and Communication in the Implementation of a Multiprocessor Operating System, M. Young, A Tavanian, R. Rashid, D. Golub, and J. Eppinger. Proceedings of the Eleventh ACM Symposium on Operating Systems Principles (Austin, Texas, United States), ACM, 1987, pages 63--76. EMERALDS: A Small-Memory Real-Time Microkernel, Zuberi et al. 17th SOSP, 1999. The Flux OSKit: A Substrate for OS and Language Research. Ford et al. 16th SOSP, Oct 1997. |
8 |
Hakim (pdf pptx) |
9/18 | Modern Systems: Extensible Kernels and Containers |
Required: Exokernel: an operating system architecture for application-level resource management, Dawson R. Engler, M. Frans Kaashoek, and James O'Toole, Jr. 15th ACM symposium on Operating systems principles (SOSP), December 1995, pages 251--266.Optional: Unikernels: library operating systems for the cloud, Anil Madhavapeddy, Richard Mortier, Charalampos Rotsos, David Scott, Balraj Singh, Thomas Gazagnaire, Steven Smith, Steven Hand, Jon Crowcroft. 18th ACM International Conference on Architectural support for programming languages and operating systems (ASPLOS), March 2014, pages 461--472. |
Extensibility, Safety and Performance in the SPIN Operating System, Brian N. Bershad, Stefan Savage, Przemyslaw Pardyak, Emin Gun Sirer, Marc E. Fiuczynski, David Becker, Craig Chambers, Susan Eggers. 15th ACM symposium on Operating systems principles (SOSP), December 1995, pages 267--283. Application performance and flexibility on Exokernel systems, Kaashoek et al. 16th SOSP, 1997. Microkernels meet Recursive Virtual Machines, Bryan Ford, Mike Hibler, Jay Lepreau, Patrick Tullmann, Godmar Back, Stephen Clawson. 2nd USENIX symposium on Operating systems design and implementation (OSDI), October 1996, pages 137-151. |
9 |
Hakim
(pdf pptx) |
9/25 |
Modern Systems: Virtualization |
The Origin of the VM/370 Time-Sharing System, R. J. Creasy, In IBM Journal of Research and Development, 25(5):483-490, September 1981. Xen and the Art of Virtualization, Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, Andrew Warfield. 19th ACM symposium on Operating systems principles (SOSP), October 2003, pp. 164--177. |
Are Virtual Machine Monitors Microkernels Done Right?, Steven Hand,
Andrew Warfield, Keir Fraser, Evangelos Kotsovinos, Dan Magenheimer.
Proceedings of the Tenth Workshop on Hot Topics in Operating Systems (HotOS), Sante Fe, NM, June 2005. Are
Virtual Machine Monitors Microkernels Done Right?, Gernot Heiser,
Volkmar Uhlig, Joshua LeVasseur. ACM SIGOPS Operating Systems Review
(OSR), Volume 40, Issue 1, January 2006, pages 95--99.
Memory resource management in VMware ESX server, C. A. Waldspurger. OSDI
2002.
Virtual Machine Monitors: Current Technology and Future Trends, Mendel
Rosenblum, Tal Garfinkel. Computer, vol. 38, no. 5, pp. 39-47, May 2005. Operating
System Support for Virtual Machines, S. T. King, G. W. Dunlap, and P. M.
Chen. 2003 USENIX Technical Conference.
The Turtles Project: Design and Implementation of Nested Virtualization,
Ben-Yehuda, M., Day, M.D., Dubitzky, Z., Factor, M., Har'El, N., Gordon, A., Liguori, A., Wasserman, O. and Yassour, B.A.
In OSDI, vol. 10, pp. 423-436. 2010. |
10 |
Hakim
(pdf pptx) |
9/27 |
Modern Systems: Multicore issues |
Required: Disco: Running
Commodity Operating Systems on Scalable Multiprocessors, Edouard
Bugnion, Scott Devine, and Mendel Rosenblum. 16th ACM symposium on
Operating systems principles (SOSP), October 1997, pages 143--156. Optional: The Multikernel: A new OS architecture for scalable multicore systems. Andrew
Baumann, Paul Barham, Pierre-Evariste Dagand, Tim Harrisy, Rebecca Isaacs, |
Thousand core chips: a technology perspective. S. Borkar. In
Proceedings of the 44th Annual Design Automation Conference, pages.
746–749, 2007. Corey: An
operating system for many cores. S. Boyd-Wickizer, H. Chen, R. Chen, Y.
Mao, F. Kaashoek, R. Morris, A. Pesterev, L. Stein, M. Wu, Y. Dai, Y. Zhang,
and Z. Zhang. In Proceedings of the 8th Tornado: maximizing locality and concurrency in a shared memory multiprocessor operating system, Ben Gamsa, Orran Krieger, Jonathan Appavoo, and Michael Stumm. 3rd USENIX symposium on Operating systems design and implementation (OSDI), February 1999, pages 87-100. |
11 |
Deepak
(pdf pptx) |
10/1 |
Modern Systems: Security |
Required:Shielding Applications from an Untrusted Cloud with Haven, Andrew Baumann and Marcus Peinado and Galen Hunt. In the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI). October 2014, pp. 267--283, Broomfield, CO. Optional: Logical Attestation: An Authorization Architecture For Trustworthy Computing. Emin Gün Sirer, Willem de Bruijn, Patrick Reynolds, Alan Shieh, Kevin Walsh, Dan Williams, and Fred B. Schneider. In Proceedings of the Symposium on Operating Systems Principles, Cascais, Portugal, October 2011. |
Software Guard Extensions (SGX) Fabric: A Platform for Secure Distributed Computation and Storage.
Jed Liu, Michael D. George, K. Vikram, Xin Qi, Lucas Waye, and Andrew C.
Myers. In ACM Symposium on Operating Systems, 2009. Pages
321-334. |
12 |
Jasleen
(pdf pptx) |
10/4 |
Storage Systems: Cloud-Scale Storage |
Required:The Google file system, Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung. 19th ACM symposium on Operating systems principles (SOSP), October 2003, 29--43. Optional:Spanner: Google's Globally Distributed Database, James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, J. J. Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, David Mwaura, David Nagle, Sean Quinlan, Rajesh Rao, Lindsay Rolig, Yasushi Saito, Michal Szymaniak, Christopher Taylor, Ruth Wang, and Dale Woodford. In Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation (OSDI'12), October 2012, 251--264. |
RACS: A Case for Cloud Storage Diversity, Hussam Abu-Libdeh, Lonnie Princehouse, Hakim Weatherspoon, In Proceedings of the First ACM Symposium on Cloud Computing (SOCC), June 2010. Finding a Needle in Haystack: Facebook's Photo Storage. Doug Beaver, Sanjeev Kumar, Harry C. Li, Jason Sobel, and Peter Vajgel. OSDI 2010. Rethink the Sync, Edmund B. Nightingale, Kaushik Veeraraghavan, Peter M. Chen, and Jason Flinn. Proceedings of the 7th USENIXE Symposium on Operating Systems Design and Implementation (OSDI), November 2006. Speculative Execution in a Distributed File System, Edmund B. Nightingale, Peter M Chen, Jason Flinn. Proceedings of the 20th ACM Symposium on Operating Systems Principles (SOSP), October 2005, pages 191--205. |
13 |
Hakim
(pdf pptx) |
10/4 | Storage Systems: Big Data Infrastructure |
Required:MapReduce: Simplified Data Processing on Large Clusters.Jeffrey Dean and Sanjay Ghemawat. (OSDI, 2004). December, 2004.
Optional: Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J. Franklin, Scott Shenker, Ion Stoica. (NSDI'12), April, 2012. |
Apache Hadoop YARN: yet another resource negotiator, Vinod Kumar Vavilapalli, Arun C. Murthy, Chris Douglas, Sharad Agarwal, Mahadev Konar, Robert Evans, Thomas Graves, Jason Lowe, Hitesh Shah, Siddharth Seth, Bikas Saha, Carlo Curino, Owen O'Malley, Sanjay Radia, Benjamin Reed, and Eric Baldeschwieler. 2013. Apache Hadoop YARN: yet another resource negotiator. In Proceedings of the 4th annual ACM Symposium on Cloud Computing (SoCC). Article 5, 16 pages, October 2013. |
Fall Break: Oct 6-9 |
|||||
14 |
Alison
(pdf pptx) |
10/16 |
Distributed Systems: Ordering and Consistent Cuts |
Required:
Time, Clocks, and the Ordering of Events in a Distributed System,
Lamport. CACM 21(7). July 1978. Optional: Distributed snapshots: Determining global states of distributed systems. Chandy, Lamport. ACM TOCS 3(1), 1985, 63-75. |
How Processes Learn. K. Mani Chandy and Jay Misra. In Proceedings of the fourth annual ACM symposium on Principles of distributed computing (PODC '85). ACM, 204-214 |
15 |
Chinasa
(pdf pptx) |
10/18 |
Distributed Systems: Putting theory into practice |
Implementing
fault-tolerant services using the state machine approach: A tutorial,
Fred Schneider. ACM Computing Surveys Volume 22, Issue 4 (December
1990), 299--319. Chain replication for high throughput and availability. Robbert van Renesse and Fred Schneider. OSDI 2004. |
Using Time Instead of Timeout for Fault-Tolerant Distributed Systems,
Lamport. ACM TOPLAS 6:2, 1974. Dangers of Replication and a Solution, Gray et al. ACM SIGMOD, Jun 1996. |
16 |
|
Distributed Systems: More atomicity options: group communication, replicated atomic objects.Eric Brewer's CAP conjecture is discussed in the extra materials but we won't get very into CAP in the class itself. |
Required:
The Process Group Approach to Reliable Distributed Computing, Birman.
CACM, Dec 1993, 36(12):37-53. Optional: Bimodal Multicast, Birman et al. ACM TOCS 17(2), May 1999. |
CAP Twelve Years Later (and another version here). Eric Brewer. IEEE Computer Magazine (special
issue on “The Growing Impact of the CAP Theorem”). Vol 45, Issue 2, February 2012, pp. 23-30. Perspectives on the CAP Theorem (and another version here). Seth Gilbert and Nancy Lynch. IEEE Computer Magazine, Vol. 45, Issue 2, February 2012, pp. 30--36.
Overcoming CAP with Consistent Soft-State Replication (and another version here).
Kenneth P. Birman, D. Freedman, Q. Huang and Patrick Dowell. IEEE Computer
Magazine (special issue on “The Growing Impact of the CAP Theorem”). Vol. 45, Issue 2, February 2012, pp. 50-58. isis2.codeplex.com: Isis2, A modern, open-source, group communication library for cloud computing applications. Ken Birman, 2011.
Sinfonia: A new paradigm for building scalable distributed systems.
Marcos K. Aguilera, Arif Merchant, Mehul Shah, Alistair Veitch, Christos
Karamanolis. November 2009 Transactions on Computer Systems (TOCS), Volume
27 Issue 3 |
|
17 |
Haobin
(pdf pptx) |
10/23 |
Distributed Systems: Paxos |
Required:Paxos
Made Moderately Complex. Robbert van Renesse and Deniz Altinbuken. ACM Computing Surveys. Vol 47, No. 3, Article 42, 36 pages, February 2015. Required:Wiki:
Paxos_Protocol |
The part-time parliament, Leslie Lamport. ACM Transactions on Computer Systems (TOCS), Vol. 16, No. 2, May 1998, pp. 133–169.
Paxos Made Simple, Leslie Lamport. ACM SIGACT NEWS, Vol. 32, No. 4, December 2001. CORFU: A Distributed Shared Log, Mahesh Balakrishnan, Dahlia Malkhi, John Davis, Vijayan Prabhakaran, Michael Wei, and Ted Wobber, ACM Transactions on Computer Systems (TOCS), ACM, 2013. |
18 |
Siqiu
(pdf pptx) |
10/25 |
Distributed Systems: Byzantine Agreement |
Required:
The Byzantine Generals Problem, Lamport et al. ACM TOPLAS 4, 1982. Optional Easy Impossibility Proofs for Distributed Consensus Problems. Michael J. Fischer , Nancy A. Lynch , Michael Merritt. ACM PODC 1986. |
Practical Byzantine Fault Tolerance, Miguel Castro and Barbara Liskov. USENIX Operating Systems Design and Implementation (OSDI), February 1999, pages 173--186. Randomized Byzantine Generals, Rabin. FOCS, 1983. Byzantine Quorum Systems, Malkhi and Reiter.
Fault-Scalable Byzantine Fault-Tolerant Services, Michael Abd-El-Malek
et.al. SOSP 2005. The Next 700 BFT Protocols. Rachid Guerraoui, Nikola Knezevic, Vivien Quéma and Marko Vukolić. In Proceedings of EuroSys, Paris, France, pp. 363-376, April 2010 |
19 | Julia | 10/30 |
Distributed Systems: FLP |
Required:Impossibility of Distributed Consensus with One Faulty Process, Fisher
et al. JACM 32(2), Apr 1985. Optional:The weakest failure detector for solving consensus, Chandra et al. J. ACM 43, 4, Jul. 1996. |
Effectively nonblocking consensus procedures can execute forever-a constructive version of flp, Robert Constable. arXiv preprint arXiv:1109.3370, September 2011. The Building Blocks of Consensus. Yee Jiun Song, Robbert van Renesse, Fred B. Schneider, Danny Dolev. The 9th International Conference on Distributed Computing and Networking (ICDCN 08), January, 2008. LNCS, Vol. 4904, pp. 54-72. |
20 | Xinwen | 11/1 |
P2P Systems: Gossip |
Required:Epidemic algorithms
for replicated database maintenance, Alan Demers, Dan Greene, Carl
Hauser, Wes Irish, John Larson, Scott Shenker, Howard Sturgis, Dan
Swinehart, Doug Terry. Appears in 6th ACM Symposium on Principles of
distributed computing (PODC), August 1987, pages 1--12. Optional:Managing update conflicts in Bayou, a weakly connected replicated storage system, Doug B. Terry, Marvin M. Theimer, Karin Petersen, Alan J. Demers, Mike J. Spreitzer, and Carl H. Hauser. In Proceedings of the 5th ACM Symposium on Operating Systems Principles (SOSP), December 1995, pages 172--182. |
Astrolabe: A Robust and Scalable Technology for Distributed System Monitoring, Management, and Data Mining, Robbert Van Renesse, Kenneth P. Birman, Werner Vogels. Appears in ACM Transactions on Computer Systems (TOCS), Volume 21, Issue 2, May 2003, pages 164--206. T-Man: Fast
Gossip-based Construction of Large-Scale Overlay Topologies. Mark
Jelasity Ozalp Babaoglu. Technical Report UBLCS-2004-7. May 2004 Bayesian inference
using data flow analysis. Claret, Guillaume, et al.
Proceedings of the 2013 9th Joint Meeting on Foundations of Software
Engineering. ACM, 2013. Probabilistic programming. IGordon, A. D., Henzinger, T. A., Nori, A. V., & Rajamani, S. K. May 2014, May. International Conference on Software Engineering (ICSE, FOSE track). |
21 | Darian | 11/6 |
P2P Systems: DHT |
Chord: A scalable peer-to-peer lookup service for internet applications, Ion Stoica, Robert Moris, David Karger, M. Frans Kaashoek, and Hari Balakrishnan. Proceedings of the ACM SIGCOMM, August 2001, 149--160. San Diego, California, United States.
The Impact of DHT Routing Geometry on Resilience and Proximity, Krishna Gummadi , Ramakrishna Gummadi , Steve Gribble , Sylvia Ratnasamy , Scott Shenker, Ion Stoica. Appears in Proceedings of ACM SIGCOMM, August 2003, pages 381--3. |
SplitStream: high-bandwidth multicast in cooperative environments.
Castro, et. al. SOSP 2003. Overcast: Reliable Multicasting with an Overlay Network, Jannotti et al. 4th OSDI, Dec 2000.
|
22 | Alexa | 11/8 | P2P Systems: Storage |
Required:
Dynamo: Amazon's Highly Available Key-Value Store, Giuseppe DeCandia,
Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman,
Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner
Vogels. In Proceedings of twenty-first ACM SIGOPS symposium on
Operating systems principles (SOSP). October 2007, pp. 205--220. Optional:Cassandra: a decentralized structured storage system, Avinash Lakshman and Prashant Malik, SIGOPS Oper. Syst. Rev. 44, 2 (April 2010) |
Pond: the OceanStore Prototype, Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST), March 2003, pages 1--14. Storage management and caching in PAST, a large scale, persistent peer-to-peer storage utility, Rowstron and Druschel. 18th SOSP, Oct 2001. Wide-area cooperative storage with CFS, Frank Dabek, M. Frans Kaashoek, David Karger, Robert Morris, and Ion Stoica. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), October 2001, pages 202--215. A Distributed Decentralized Information Storage and Retrieval System, Ian Clark. University of Edinburgh, 1999 (Original unpublished technical report on Freenet). Why Gnutella can't scale, no really?, Ritter, Feb 2001. |
23 | Gloire | 11/13 | Networked Systems: TCP |
Congestion Avoidance and Control, Van Jacobson. Appears in Proceedings of ACM SIGCOMM, Vo1ume 18, Number 4, (August 1988). TCP Congestion Control with a Misbehaving Receiver, Stefan Savage, Neal Cardwell, David Wetherall and Tom Anderson, Appears in ACM SIGCOMM Computer Communication Review, Volume 29 , Issue 5 (October 1999), pages 71--78. |
RouteBricks:
Exploiting Parallelism To Scale Software Routers Mihai Dobrescu and Norbert Egi, Katerina Argyraki, Byung-Gon Chun, Kevin Fall, Gianluca Iannaccone, Allan Knies, Maziar Manesh, Sylvia Ratnasamy 22nd ACM Symposium on Operating Systems Principles (SOSP), October 2009 Routers for the Cloud. Can the Internet Achieve 5-Nines Availability? Andrei Agapi, Ken Birman, Robert Broberg, Chase Cotton, Thilo Kielmann, Martin Millnert, Rick Payne, Robert Surton, and Robbert VanRenesse. IEEE Internet Computing. Volume 15. Issue 5. pp.72 - 77. September, October 2011. |
24 | Daniel |
11/20 |
Networked Systems: Kernel-bypass/RDMA |
Required: U-Net: A User-Level
Network Interface for Parallel and Distributed Computing, Von Eicken,
Basu, Buch and Werner Vogels. 15th SOSP, December 1995. Optional:
FaRM: Fast Remote Memory, Aleksandar Dragojević, Dushyanth
Narayanan, Orion Hodson, and Miguel Castro; Microsoft Research |
Evaluation of the Virtual Interface Architecture (VIA). Towards
an active network architecture, David L. Tennenhouse and David J.
Wetherall. Appears in ACM SIGCOMM Computer Communication Review (CCR),
Volume 37, Issue 5 (October 2007), pages 81--94.
A
Survey of Active Network Research, David L. Tennenhouse, Jonathan M.
Smith, W. David Sincoskie, David J. Wetherall, and Gary J. Minden. Appears
in IEEE Communications Magazine, Volume 35, Issue 5 (October 1997),
pages 80--86. |
Thanksgiving Holiday: Nov 21-Nov 25 |
|||||
25 | Kevin |
11/27 |
Networked Systems: Kernel-bypass/RDMA |
Required:
Arrakis: The Operating System is the Control Plane. Simon
Peter, Jialin Li, Irene Zhang, Dan R. K. Ports, Doug Woos, Arvind
Krishnamurthy, Optional: IX: A Protected Dataplane Operating System for High Throughput and Low Latency. Adam Belay, George Prekas, Ana Klimovic, Samuel Grossman, and Christos Kozyrakis, Edouard Bugnion. OSDI 2014. |
Profiling a
warehouse-scale computer. Svilen Kanev, et al.
ISCA 2015. Slides.
This paper basically says that 90% of potential
compute performance in Google's main data center is lost
in hardware stalls of various kinds!
High performance RDMA-based MPI implementation over InfiniBand. Liu,
Jiuxing, Jiesheng Wu, and Dhabaleswar K. Panda. International Journal
of Parallel Programming 32.3 (2004): 167-198. WireDirect Enables New Benchmarks For Low Latency UDP. Chelsio white paper. |
26 |
11/29 |
Networked Systems: SDN |
OpenFlow:
Enabling innovation in campus networks. Nick McKeown et al. (2008-04).
ACM Communications Review. Required: The Road to SDN: An Intellectual History of Programmable Networks. Nick Feamster, Jennifer Rexford, and Ellen Zegura. Appears in ACM SIGCOMM Computer Communication Review, Volume 44 , Issue 2 (April 2014), pages 87--98. |
Frenetic: A High-Level Langauge for OpenFlow Networks. Nate Foster, Rob Harrison, Matthew L. Meola, Michael J. Freedman, Jennifer Rexford, and David Walker. In ACM Workshop on Programmable Routers for Extensible Services of Tomorrow (PRESTO), Philadelphia, PA, November 2010. Abstractions for Network Update. Mark Reitblatt, Nate Foster, Jennifer Rexford, Cole Schlesinger, and David Walker. In ACM SIGCOMM Conference, Helsinki, Finland, August 2012. |
|
27 | 11/29 | Networked Systems: SDN and P4 |
Optional: NetCache: Balancing Key-Value Stores with Fast In-Network Caching, Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soulé, Jeongkeun Lee, Nate Foster, Changhoon Kim, and Ion Stoica. 26th ACM symposium on Operating systems principles (SOSP), October 2017, pages 121--136. |
||
|
|
12/4 |
Final Presentations and (if relevant) demos |