This semester I’m the faculty chair of the AU CS Colloquium, and due to it being completely online this semester I decided to open it up to the wider community. So I’ll be doing some advertising here to get y’all to come and attend some great presentations.
The complete schedule can be found here:
Topics will vary and will include PL, category theory, security, and diversity in CS this semester.
If you would like to attend please RSVP to me.
The first presentation is by a NEW AU faculty member, Dr. Edward Tremel:
Title: Dependable Systems for Managing Valuable Data
Distributed computing systems have become an essential piece of infrastructure for today’s businesses, organizations, and governments; they store, manage and interpret vast amounts of data, and allow groups of people to communicate and take coordinated actions. With the emergence of Internet-of-Things devices, distributed systems are also responsible for monitoring and automating the physical world. As a result, distributed systems face competing demands: they must be dependable enough to be relied upon for essential tasks, but responsive enough to handle requests with tight time constraints.
My research focuses on building distributed systems that provide dependability guarantees while still maintaining high performance. In this talk I will describe my work on one of these systems, Derecho, which is a library for building replicated datacenter applications. Derecho allows distributed services to increase their fault-tolerance and scalability by organizing servers into replicated shards, while achieving incredibly high data throughput and low response latency. It accomplishes this by redesigning state-machine-replication and fault-management protocols to use non-blocking, asynchronous communication, and separating data flow from control messages. Derecho also includes optional mechanisms for making services durable by logging state to persistent storage, and in the second half of my talk I will describe my work on a distributed restart algorithm for Derecho and other durable replicated services. This algorithm efficiently restarts a sharded, replicated service from state stored in persistent logs, while respecting node placement constraints and ensuring the state is mutually consistent across all subsystems.
More details can be found here.
Leave a comment by sending me an Email