15. Dynamic Programming, Part 1: SRTBOT, Fib, DAGs, Bowling

00:57:18
https://www.youtube.com/watch?v=r4-cftqTcdI

Ringkasan

TLDRThis lecture introduces a new section on algorithmic design with a focus on the powerful and widely-used paradigm of dynamic programming (DP). It explains the basic DP methodology using a template called SRTBOT, which helps in designing recursive algorithms efficiently. The lecture uses the Fibonacci sequence to demonstrate the transition of recursive algorithms from exponential to polynomial time complexity through memoization. Basic concepts such as recursion and memoization are clarified before moving on to more complex problems. The session includes examples such as merge sort and shortest path in a DAG, showing their applications under dynamic programming frameworks. This is followed by a detailed exploration of a new problem, modeled as a custom bowling game, illustrating the application of DP in finding optimal strategies under constraints. The lecture emphasizes on the challenges in sub-problem design in the context of dynamic programming and proposes utilizing sequence-based breakdowns like prefixes, suffixes, and substrings. The intended takeaway is to equip participants with a framework to tackle a wide variety of algorithmic problems systematically using dynamic programming.

Takeaways

  • 🔍 Dynamic programming is focused on breaking problems into smaller subproblems and solving each just once.
  • 🧮 SRTBOT template is used for designing recursive algorithms based on dynamic programming.
  • 🚀 Memoization can drastically improve algorithm efficiency by remembering past computations.
  • 📈 Fibonacci sequence is a classic problem to demonstrate dynamic programming and memoization.
  • 🛤️ DAG shortest paths can be solved using dynamic programming techniques.
  • 🎳 Custom bowling game problem showcases practical applications of dynamic programming.
  • 💡 Using sequence-based decompositions helps in defining subproblems for dynamic programming.
  • 🔄 Recursive algorithms can be transformed into more efficient ones using dynamic programming.
  • ⚖️ Correct dynamic programming usage should yield polynomial time solutions.
  • 🧠 Efficient subproblem formulation is crucial in dynamic programming.

Garis waktu

  • 00:00:00 - 00:05:00

    Erik Demaine introduces a new section of the course focusing on algorithmic design, particularly dynamic programming, a recursive algorithmic design paradigm. This approach allows solving complex problems by writing constant-sized code, independent of problem size, through recursion or loops. The class uses proof-by-induction and subproblem graphs to discuss algorithm design, introducing an acronym SRTBOT (Sub-problems, Relations, Topological order, Base case, Original problem, Time) to help remember the steps in recursive algorithm design. Dynamic programming builds on this with memoization, reusing previous work.

  • 00:05:00 - 00:10:00

    To avoid infinite loops in recursive algorithm calls, ensure the dependencies between subproblems are acyclic, which involves specifying a topological order. Base cases solve recursive structures, and the goal is to solve the original problem using subproblems. The course will explore dynamic programming through various examples, starting with illustrating via merge sort, though it's a divide and conquer algorithm. Merge sort involves sorting sub-arrays and using recurrence relations to solve problems iteratively, maintaining a topological order and base case while sticking to known recurrence solutions like the one in merge sort for analysis.

  • 00:10:00 - 00:15:00

    Illustrating recursion on Fibonacci numbers through a toy problem to introduce memoization, a key concept in dynamic programming. The Fibonacci calculation using recursion without memoization results in exponential time complexity due to repeated calculations. With memoization, results of subproblems are cached and reused, drastically improving efficiency to linear time by reducing redundant calculations. Explaining through drawings and examples, Demaine emphasizes rewriting recursions to check cache first before proceeding and how this significantly cuts down computation time and complexity.

  • 00:15:00 - 00:20:00

    Detailed explanation of leveraging memoization with Fibonacci recursion to optimize from exponential to linear time solutions demonstrates the utility of this technique in reducing redundant calculations. The memoization technique ensures that each subproblem is only solved once, providing significant computational savings. Discussion on how traditional assumptions about arithmetic operations being free in typical algorithm analysis are adjusted, emphasizing polynomial growth versus exponential along with explorations into merge sort's complexities, confirming it's an inefficient use of memoization.

  • 00:20:00 - 00:25:00

    Illustrating the inadequate outcome of applying recursive approaches to different problem types like merge sort, emphasizing the utility and power of dynamic programming when used appropriately. Introducing the DAG (Directed Acyclic Graph) shortest paths problem, a distinct case of reuse where algorithmic insights can bridge dynamic programming concepts applied in efficient, non-recursive manners. Highlighting dynamic programming's efficiency through examples like solving DAG shortest paths by leveraging direct edge calculations and recursion to deliver improved time complexities compared to naive methods.

  • 00:25:00 - 00:30:00

    Describing dynamic programming interpretations as recursive computations with memoization, hinting at how algorithms recursively employ depth-first search structures inherently due to problem type and constraints. Through DAG shortest paths, Demaine articulates how dynamic programming captures implicit explorations, solving computational problems by systematically breaking down dependencies between given graph nodes, emphasizing the automated topological ordering ensured through memoization, effectively converging memory and computation affordably.

  • 00:30:00 - 00:35:00

    Introducing bowling as a novel computational problem to showcase dynamic programming in practice. With constraints resembling real-life problems, this example is devised to demonstrate computational efficiency, utilizing dynamic programming principles. Using simple rules and adding scoring functions, the bowling challenge is constructed as a fun analogy to algorithmic problem-solving. Demaine explains setup, criteria, and scoring, presenting the problem as a sequential challenge, which is crucial for understanding the practicality of algorithmic concepts applied in sequence analysis.

  • 00:35:00 - 00:40:00

    Demaine introduces a new unique problem setup, playing an ancient-like bowling game where values are attached to each pin, with the emphasis on maximizing score through strategic gameplay, providing an engaging classroom challenge to solve using dynamic programming. Bowling serves as a metaphorical ground for applying ideas about sequence influence and impact of actions, reflecting on problem-solving approaches that heavily rely on strategic considerations of operations order and possible outcomes, thus pushing attention towards maximizing expected outcomes using sequence logic.

  • 00:40:00 - 00:45:00

    He presents SRTBOT methodology applied to bowling problem-solving. By breaking down pins into sequential sub-problems and evaluating various solutions pathways, the key concepts of dynamic programming come into play. Demaine outlines how suffixes of sequences can serve as essential sub-problems in dynamic problem formulations. Through strategic evaluation of each pin's possible interactions and the subsequent effect on the configuration, outcomes are optimized using local brute force creatively bounded by predefined subproblem scopes, avoiding exponential complexity growth.

  • 00:45:00 - 00:50:00

    The explanation extends to implementing dynamic programming solutions both recursively and iteratively (bottom-up), highlighting the procedural transformation from theoretical model to practical coding. Through structured conditional logic, explicit ordering, and simplicity in mathematical expressions (like max/min functions), algorithmic joys are shared, linking dynamic model adjustments and real-world coding habits (e.g., bottom-up implementation mimicking memoization). Demaine emphasizes computational thinking where problems are visualized as sequence-bound, favoring early breakdowns and task automation.

  • 00:50:00 - 00:57:18

    Concluding, he reiterates dynamic programming's strength in handling exponential scenarios by structuring computations around sequence-based sub-problems, advocating for preemptive identification of subproblem boundaries to leverage memorized knowledge efficiently. This method optimizes exploration breadth, maintaining control over planning spaces in typical algorithmic fashion, reinforcing the beauty and utility of recursive thinking across varied domains. Upcoming lectures promise further examples to deepen understanding and broaden application spectrum within this refined algorithmic scaffold.

Tampilkan lebih banyak

Peta Pikiran

Mind Map

Pertanyaan yang Sering Diajukan

  • What is dynamic programming?

    Dynamic programming is an algorithmic design paradigm focused on solving complex problems by breaking them down into smaller, simpler subproblems and solving each of those subproblems just once, storing their solutions - typically using a recursive approach enhanced by memoization.

  • What is the SRTBOT template?

    The SRTBOT stands for Sub-problems, Relations, Topological order, Base case, Original problem, and Time, serving as a template to design recursive algorithms based on dynamic programming.

  • What is memoization in dynamic programming?

    Memoization is a technique used in dynamic programming to cache the results of expensive function calls and return the cached result when the same inputs occur again.

  • How does dynamic programming transform a recursive algorithm?

    The simplest transformation involves checking if a sub-problem has already been solved using memoization and reusing the result instead of re-computing it, thus avoiding redundant calculations.

  • Which example was used to demonstrate memoization?

    Fibonacci sequence problem was used to illustrate memoization. The naive recursive solution takes exponential time, but applying memoization transforms it into a polynomial time complexity problem.

  • Can dynamic programming be applied to shortest path problems?

    Yes, by computing the shortest paths using a recursive relation for every vertex to calculate the shortest distance in a directed acyclic graph (DAG), a form of DP can be employed.

  • What real-world-like problem was discussed that uses dynamic programming?

    The lecture discusses a variant bowling game where each pin has a different score and the goal was to maximize scores within given rules, showcasing practical application of dynamic programming.

  • What is the time complexity difference in solving Fibonacci with and without memoization?

    The running time for the naive recursive Fibonacci sequence calculation is exponential, while with memoization applied, it becomes linear.

  • What is the expected time complexity when using dynamic programming correctly?

    Linear time, as each subproblem is typically solved once, reducing overall computation times by avoiding redundancies through memoization.

  • What are common sub-problem formulations in dynamic programming?

    Prefixed and suffixed are generally efficient with fewer sub-problems to analyze, while substring decompositions provide detailed granularity useful in complex problem definitions.

Lihat lebih banyak ringkasan video

Dapatkan akses instan ke ringkasan video YouTube gratis yang didukung oleh AI!
Teks
en
Gulir Otomatis:
  • 00:00:00
    [SQUEAKING] [RUSTLING] [CLICKING]
  • 00:00:04
  • 00:00:12
    ERIK DEMAINE: All right, welcome back to 006.
  • 00:00:15
    Today we start a totally new section of the class.
  • 00:00:18
    Up till now, we've mostly been showing
  • 00:00:21
    you really cool and powerful algorithms, sorting algorithms,
  • 00:00:25
    graph algorithms, data structures, trees, lots
  • 00:00:28
    of good stuff that you can apply to solve tons
  • 00:00:30
    of algorithmic problems, either by reducing to the data
  • 00:00:34
    structures that we showed you, or reducing to graph problems
  • 00:00:38
    that we showed you, or by modifying
  • 00:00:40
    those algorithms a bit.
  • 00:00:41
    Today we're going to start a new section on algorithmic design--
  • 00:00:45
    how to, from scratch, come up with a polynomial time
  • 00:00:49
    algorithm to solve a problem.
  • 00:00:50
  • 00:00:52
    And in particular, we're going to talk
  • 00:00:54
    about a algorithmic design paradigm called
  • 00:00:57
    dynamic programming, which is extremely powerful.
  • 00:01:00
    It's probably the most powerful algorithmic design paradigm.
  • 00:01:02
    Very general.
  • 00:01:03
    Can solve lots of problems.
  • 00:01:06
    It's a particular type of recursive algorithm design.
  • 00:01:10
    And in general, this class--
  • 00:01:14
    all of algorithms-- is about recursive algorithm design
  • 00:01:17
    at some level, because we want to write
  • 00:01:19
    constant-sized pieces of code that solve
  • 00:01:22
    problems of arbitrary size.
  • 00:01:23
    We have some problem size n and we're
  • 00:01:25
    trying to write 100 lines of code or whatever,
  • 00:01:27
    some constant amount that doesn't depend on the problem
  • 00:01:30
    size.
  • 00:01:30
    We have one algorithm that solves
  • 00:01:32
    all instances of the problem.
  • 00:01:34
    And so we have to write code that is recursive or uses loops
  • 00:01:39
    or somehow reuses the instructions
  • 00:01:41
    that we give the computer.
  • 00:01:43
    And you may know you can convert any algorithm based
  • 00:01:46
    on loops into an algorithm using recursion.
  • 00:01:48
    And we're going to take the recursive view today,
  • 00:01:51
    in particular because it fits very
  • 00:01:53
    well with our proof-by-induction technique, which we've
  • 00:01:56
    used throughout this class, but also because it gives us
  • 00:02:00
    some structure on how different subproblems relate in something
  • 00:02:04
    called a subproblem graph, that we'll be talking about today.
  • 00:02:08
    And so we're going to start out with, in general, how do we
  • 00:02:11
    design recursive algorithms?
  • 00:02:13
    That's sort of the overall, encompassing everything.
  • 00:02:16
    We have thought very hard to come up
  • 00:02:18
    with a cool acronym for this paradigm which we invented
  • 00:02:21
    called SRTBOT--
  • 00:02:23
    thanks, Jason.
  • 00:02:25
    And so we'll talk-- it's not actually for sorting.
  • 00:02:27
    It's just an acronym for sub-problems, relations,
  • 00:02:33
    topological order, base case, original problem, and time.
  • 00:02:39
    But it's an acronym that will help you remember
  • 00:02:42
    all the steps you need in order to specify
  • 00:02:44
    a recursive algorithm.
  • 00:02:46
    And dynamic programming is going to build
  • 00:02:47
    on this template by adding one new idea called
  • 00:02:52
    memoization, which is just the idea of reusing work
  • 00:02:55
    that you've done before.
  • 00:02:57
    And that's going to let us solve tons of problems.
  • 00:03:00
    And let's see.
  • 00:03:02
    I don't-- let's get into it.
  • 00:03:05
    So we'll start out today with SRTBOT.
  • 00:03:08
    So here is SRTBOT down the column here.
  • 00:03:13
    This is a recursive algorithm design paradigm.
  • 00:03:16
    And in general, what we're going to do
  • 00:03:18
    is take the problem that we actually want to solve
  • 00:03:20
    and split it up into lots of possible sub problems.
  • 00:03:24
    And so the first part is to define what
  • 00:03:26
    the heck are the subproblems.
  • 00:03:27
    In general, we'll want some polynomial number of them.
  • 00:03:30
    But it's pretty open-ended what these look like.
  • 00:03:34
    And the hardest part, usually, in defining
  • 00:03:37
    a recursive algorithm is figuring out
  • 00:03:39
    what the sub problems should be.
  • 00:03:40
    Usually they're related to the problem you want to solve.
  • 00:03:42
    Often the problem you want to solve--
  • 00:03:45
    this is actually near the last step--
  • 00:03:46
    the original problem you're trying to solve
  • 00:03:48
    is often one of these sub problems.
  • 00:03:51
    And then you use the smaller sub problems
  • 00:03:53
    in order to build up the final, original problem.
  • 00:03:56
    But sometimes at the end, you need
  • 00:03:58
    to take a bunch of subproblems and combine them
  • 00:04:00
    into your original problem.
  • 00:04:01
    You can think-- one analogy you can think of here
  • 00:04:04
    is divide and conquer algorithms,
  • 00:04:06
    which also had this kind of style.
  • 00:04:09
    But more generally, we're going to relate different sub problem
  • 00:04:12
    solutions with some recursive structure-- some recurrence
  • 00:04:17
    relation.
  • 00:04:19
    This is just a recursive algorithm
  • 00:04:21
    that defines how to solve one problem in terms
  • 00:04:24
    of smaller sub-problems for some notion of smaller.
  • 00:04:28
    And this is given by the topological order.
  • 00:04:31
    So if we think of the subproblems as a graph
  • 00:04:34
    and we draw an edge between--
  • 00:04:37
    so the vertices of the graph are sub problems.
  • 00:04:40
    The edges are the dependencies between those subproblems.
  • 00:04:42
    Then what we'd like is the topological ordering,
  • 00:04:45
    the topological sort problem we talked about in the context
  • 00:04:48
    of DFS or DAG shortest paths.
  • 00:04:51
    What we would like is that the subproblems and the calls--
  • 00:04:56
    the recursive calls between them in this recursive relation--
  • 00:04:59
    forms a DAG.
  • 00:05:00
    We want it to be acyclic, otherwise
  • 00:05:02
    you have an infinite loop in your recursive calls.
  • 00:05:05
    If you have a cycle, you'll never terminate.
  • 00:05:09
    And so to make sure that these dependencies
  • 00:05:12
    between subproblems given by this recurrence relation
  • 00:05:15
    is acyclic, one way to do that is
  • 00:05:18
    to specify a topological order.
  • 00:05:20
    Or you could prove it some other way.
  • 00:05:22
    But often it's just a for loop to say, just do it
  • 00:05:25
    in this order.
  • 00:05:27
    Then of course any recursive structure needs base cases.
  • 00:05:31
    So that's a useful step not to forget.
  • 00:05:33
    We want to solve the original problem using these sub
  • 00:05:35
    problems.
  • 00:05:36
    And then we analyze a running time at the end.
  • 00:05:38
    So six easy steps.
  • 00:05:42
    Actually, the hardest ones are these two,
  • 00:05:43
    which are interrelated.
  • 00:05:45
    And what we're going to see over the next four lectures--
  • 00:05:49
    this is the first of four lectures
  • 00:05:50
    on dynamic programming-- is lots of examples of applying
  • 00:05:53
    this paradigm over and over together with the memoization
  • 00:05:56
    idea, which we'll get to soon.
  • 00:05:58
    Let's see an example first of an algorithm we've already seen,
  • 00:06:03
    which is merge sort, so a divide and conquer algorithm,
  • 00:06:05
    phrased with this structure of SRTBOT.
  • 00:06:09
    So for the sub problems--
  • 00:06:11
    so our original problem is to sort the elements of A.
  • 00:06:14
    And some sub-problems that we solve along the way are sorting
  • 00:06:18
    different sub-arrays of A. So for every--
  • 00:06:21
    well, not for every i and j, but for some i and js,
  • 00:06:24
    we sort the items from i up to j minus 1.
  • 00:06:28
    So I'm going to define that subproblem to be s of ij.
  • 00:06:32
    So this is something that I might want to solve.
  • 00:06:34
    The original problem that I want to solve
  • 00:06:37
    is s of 0 comma n, where n is the length of the array.
  • 00:06:40
    So that's what I actually care about in the end.
  • 00:06:42
    But we're going to solve that by writing it recursively
  • 00:06:45
    in terms of sorting different sub-arrays as follows.
  • 00:06:49
    This is the recurrence relation.
  • 00:06:51
    I've written it very simply here.
  • 00:06:53
    Of course, there's a merge algorithm,
  • 00:06:55
    which is somewhat complicated.
  • 00:06:57
    But as we saw the two finger linear time merge algorithm,
  • 00:07:00
    given two sorted arrays--
  • 00:07:04
    so this is supposed to be the sorted array
  • 00:07:06
    version of the items i through m. m
  • 00:07:09
    is the middle element between i and j
  • 00:07:11
    and the sorted array of the items from m up to j.
  • 00:07:15
    If we merge those, that gives us the sorted array
  • 00:07:18
    from i up to j.
  • 00:07:21
    And that's exactly what merge sort does.
  • 00:07:24
    So in general, this relation is just
  • 00:07:28
    some algorithm for if you're given the solutions
  • 00:07:33
    to some smaller subproblems, how do I solve the subproblem
  • 00:07:38
    that I want to solve?
  • 00:07:41
    And so we need to make sure that this problem is bigger
  • 00:07:46
    than the ones that we recursively call on
  • 00:07:48
    and that we don't get an infinite cyclic loop
  • 00:07:51
    of recursions.
  • 00:07:51
    And here our valid topological order
  • 00:07:53
    is to say, solve these problems in order
  • 00:07:57
    where j minus i-- the length of the sub-array-- is increasing.
  • 00:08:02
    And then you can check because m is strictly between i and j.
  • 00:08:05
    As long as we're not in a base case, then we know we can--
  • 00:08:11
    these subarrays will be smaller than this one.
  • 00:08:14
    And so this increasing order gives us
  • 00:08:15
    a valid topological order on all of the problems, all
  • 00:08:19
    the subproblems.
  • 00:08:20
    We have a base case, which is if we
  • 00:08:22
    don't want to sort anything, that's the empty array,
  • 00:08:25
    or at least in the original problem.
  • 00:08:27
    And then running time is--
  • 00:08:29
    I mean, there's no better way to solve it than the recurrence
  • 00:08:32
    that we already saw how to solve.
  • 00:08:34
    So this is just another way to think of n log n
  • 00:08:36
    merge sort in this labeled framework of SRTBOT.
  • 00:08:41
    Let's get to another problem that
  • 00:08:44
    does not fit recursion so well.
  • 00:08:48
    But we can make it better.
  • 00:08:51
    So this is-- we're going to start
  • 00:08:53
    with a very simple problem, which
  • 00:08:54
    is computing Fibonacci numbers.
  • 00:08:59
    It's really just a toy problem to illustrate a very powerful
  • 00:09:02
    idea, which is memoization.
  • 00:09:06
    So the problem I'm interested in is I'm
  • 00:09:08
    given a particular number, n.
  • 00:09:10
    And I want to compute the nth Fibonacci number.
  • 00:09:14
    And in case you forgot, the nth Fibonacci number
  • 00:09:17
    is given by this recurrence. fn is fn minus 1 plus fn minus 2
  • 00:09:22
    with base case, let's say, f1 equals f2 equals 1.
  • 00:09:26
  • 00:09:29
    And so we'd like to compute this.
  • 00:09:30
    This seems-- this is a recurrence.
  • 00:09:33
    So it seems very natural to write it
  • 00:09:36
    as a recursive algorithm.
  • 00:09:37
    So let's try to do it.
  • 00:09:38
    We start with what are the sub problems.
  • 00:09:41
    The obvious sub problems are just
  • 00:09:45
    the various Fibonacci numbers, f i for i between 1 and n.
  • 00:09:55
    So there are n of these sub problems.
  • 00:09:57
  • 00:10:00
    Cool.
  • 00:10:01
    Let's see.
  • 00:10:02
    We want a relation between them.
  • 00:10:04
  • 00:10:07
    Well, maybe just to distinguish the problems from the Fibonacci
  • 00:10:10
    numbers, let me write f of i.
  • 00:10:13
    This is a function, an algorithm we're going to define.
  • 00:10:16
    And it's defined to be--
  • 00:10:18
    the goal we're trying to get is the ith Fibonacci number
  • 00:10:22
    given i.
  • 00:10:23
    And then we can write the recurrence relation
  • 00:10:26
    on these guys, just f of i equals f of i minus 1
  • 00:10:31
    plus f of i minus 2.
  • 00:10:34
    So in other words, recursively compute those Fibonacci numbers
  • 00:10:37
    then add them together.
  • 00:10:39
    That's an algorithm.
  • 00:10:41
    Next is t for topological order.
  • 00:10:44
  • 00:10:48
    Here, of course, we just want to compute
  • 00:10:51
    these in order of increasing i from the base case is up.
  • 00:10:57
    Another way I like to write this is as a for loop for i
  • 00:11:01
    equals 1 to n.
  • 00:11:04
    We will see why.
  • 00:11:06
    But this gives an explicit order to compute these sub problems.
  • 00:11:10
  • 00:11:13
    And base case is just the same as the Fibonacci numbers,
  • 00:11:21
    but I guess I should write in parentheses.
  • 00:11:23
  • 00:11:25
    The original problem we want to solve is f of n.
  • 00:11:30
    And the time-- all right, here's where
  • 00:11:32
    things get interesting or bad.
  • 00:11:35
    So what is the running time of this recursive algorithm?
  • 00:11:39
    As I've stated it so far, the running time
  • 00:11:42
    is given by a recurrence.
  • 00:11:46
    Let's write the recurrence.
  • 00:11:47
    So in order to compute f of n, I recursively
  • 00:11:52
    compute f of i minus 1 or f of n minus 1 here.
  • 00:11:58
    And I recursively compute f of n minus 2.
  • 00:12:03
    So that will take t of n minus 2.
  • 00:12:05
    This first step will take t of n minus 1.
  • 00:12:07
    And now I need to solve this recurrence.
  • 00:12:11
    This is not a recurrence that falls to the master method.
  • 00:12:14
    It doesn't have a divided by.
  • 00:12:17
    So we have to think about it a little bit.
  • 00:12:19
    But we don't have to think about it
  • 00:12:20
    too hard, because this recurrence is
  • 00:12:23
    the same as this recurrence, which
  • 00:12:25
    is the same as this recurrence.
  • 00:12:26
    I've written it three times now.
  • 00:12:27
    And so the solution to this is the nth Fibonacci number.
  • 00:12:32
    Oh, sorry.
  • 00:12:33
    It's a little bit worse because in addition
  • 00:12:35
    to those recursions, I also spend constant time
  • 00:12:38
    to do the addition, maybe more than constant time.
  • 00:12:40
    But if we just count the number of additions we do,
  • 00:12:44
    it will be plus 1 additions.
  • 00:12:51
  • 00:12:53
    OK.
  • 00:12:54
    But this is bigger than the nth Fibonacci number.
  • 00:12:58
    And if you know anything about Fibonacci numbers,
  • 00:13:01
    they grow exponentially.
  • 00:13:03
    They're about golden ratio to the end.
  • 00:13:06
    I'm wearing golden ratio, in case you forgot the number.
  • 00:13:09
    So that's bad, because golden ratio is bigger than 1.
  • 00:13:13
    So this is exponential growth, as we know, especially
  • 00:13:16
    in this time, exponential growth is bad.
  • 00:13:18
    In algorithms, exponential growth
  • 00:13:19
    is bad, because we can only solve very small problems
  • 00:13:22
    with exponential growth.
  • 00:13:23
    Very small n.
  • 00:13:25
    So this is a terrible way to compute the nth Fibonacci
  • 00:13:28
    number--
  • 00:13:29
    exponential bad.
  • 00:13:40
    OK, so don't do this.
  • 00:13:43
    But there's a very tiny tweak to this algorithm
  • 00:13:47
    that makes it really good, which is memoization.
  • 00:13:53
    And this is a big idea.
  • 00:13:55
    It is the big idea of dynamic programming.
  • 00:13:59
  • 00:14:02
    It's a funny word, probably made up by computer scientists.
  • 00:14:07
    Instead of memorization, it's memoization,
  • 00:14:11
    because we're going to write things down in a memo pad.
  • 00:14:14
    It's the idea.
  • 00:14:16
    And it's a very simple idea, which
  • 00:14:18
    is just remember and reuse solutions to sub-problems.
  • 00:14:24
  • 00:14:35
    So let's draw the recursion tree for this recursive algorithm
  • 00:14:42
    as we've done it so far.
  • 00:14:43
    So at the top, we-- let me make a little bit of space.
  • 00:14:48
    At the top we are calling f of n.
  • 00:14:53
    And then that calls f of n minus 1 and f of n minus 2.
  • 00:14:59
    And it does an addition up here.
  • 00:15:00
    And then this calls f of n minus 2.
  • 00:15:04
    And this calls f of n minus 3.
  • 00:15:08
    This calls f of n minus 3.
  • 00:15:11
    And this calls f of n minus 4.
  • 00:15:15
    OK.
  • 00:15:16
    And we notice that this sub problem is
  • 00:15:23
    the same as this sub problem.
  • 00:15:25
    So to compute f of n minus 1, I need f of minus 3.
  • 00:15:28
    And also to compute f of n minus 2 I need f of n minus 3.
  • 00:15:32
    So why are we computing it twice?
  • 00:15:33
    Let's just do it once.
  • 00:15:36
    When we solve it, let's write it in a table somewhere.
  • 00:15:39
    And then when we need it again, we'll just reuse that value.
  • 00:15:42
    Question?
  • 00:15:42
    AUDIENCE: What about the f of n minus 2?
  • 00:15:44
    ERIK DEMAINE: f of n minus 2 is also shared.
  • 00:15:46
    So let me use a different symbol.
  • 00:15:49
    f of n minus 2 is already here.
  • 00:15:53
    So this was at the same level.
  • 00:15:54
    But we also get shared reuse between different levels.
  • 00:15:57
    In fact, I wouldn't even call f of n minus 3
  • 00:15:59
    because this whole part doesn't need
  • 00:16:02
    to be computed a second time.
  • 00:16:03
    If I already computed it here, it
  • 00:16:05
    doesn't matter which one comes first.
  • 00:16:07
    Let's say this one comes first.
  • 00:16:08
    Once this is done, I can write it down and reuse it over here.
  • 00:16:11
  • 00:16:14
    And then in here, we're going to call f of n minus three.
  • 00:16:18
    So there's still another computation of f of n minus 3.
  • 00:16:21
    When that one's done, I won't need to do this recursively.
  • 00:16:25
    OK, so magically this is going to make
  • 00:16:28
    this algorithm efficient with this very simple tweak.
  • 00:16:32
    Let me write down the tweak more explicitly.
  • 00:16:35
    I won't write code here.
  • 00:16:36
    But just describe it as a data structure.
  • 00:16:41
    So we're going to maintain our good friend, the dictionary,
  • 00:16:47
    which is abstract data type or interface.
  • 00:16:52
    We could use different data structures to do it.
  • 00:16:54
    But we're going to map some problems
  • 00:16:57
    to their solutions, at least the ones that we've solved already.
  • 00:17:01
  • 00:17:04
    And usually we can do this with just a direct access
  • 00:17:07
    array, though you could use a hash table.
  • 00:17:09
    Just get expected bounce.
  • 00:17:12
    So when we write the code for our recursive function--
  • 00:17:23
    so in general, once we have a sort bot description,
  • 00:17:26
    we can turn this into code.
  • 00:17:28
    We define f of i.
  • 00:17:30
    And it says am I in a base case?
  • 00:17:32
    If so, return this.
  • 00:17:34
    Otherwise, do this recursive call.
  • 00:17:36
    That's our recursive algorithm.
  • 00:17:38
    But we're going to do a little more now.
  • 00:17:39
    And first we're going to check whether this sub
  • 00:17:44
    problem that we're trying to solve has already been solved.
  • 00:17:49
    And if so, we return that storage solution.
  • 00:17:55
    That's the easy case, but it might not exist.
  • 00:17:59
  • 00:18:10
    And then we'll compute it in the usual way.
  • 00:18:14
    So what the code then would look like to define f of i
  • 00:18:19
    is first we check is i in our data structure.
  • 00:18:22
    This is usually called the memo.
  • 00:18:25
  • 00:18:28
    So we say, is this sub-problem-- is i in my memo data structure?
  • 00:18:33
    If so just return memo of i.
  • 00:18:34
    Done.
  • 00:18:35
    No recursion necessary.
  • 00:18:36
    Otherwise, check if I'm a base case.
  • 00:18:39
    If so, done.
  • 00:18:40
    Otherwise, recurse.
  • 00:18:42
    So recursively call f of i minus 1 and f of i minus 2.
  • 00:18:46
    And in this recursion, we can see
  • 00:18:48
    that after we call f of i minus 1, in fact,
  • 00:18:50
    it will have already computed f of i minus 2.
  • 00:18:52
    So while this call is recursive, this one
  • 00:18:55
    will immediately terminate because i minus 2
  • 00:18:57
    will already be in the memo table.
  • 00:18:59
    And so if you think about what happens, in fact,
  • 00:19:03
    we'll just have recursion down the left branch of this thing.
  • 00:19:07
    And all the right branches will be free.
  • 00:19:09
    We can just look things up in the memo table.
  • 00:19:12
    So what is the overall running time?
  • 00:19:14
    For Fibonacci, this should be order n.
  • 00:19:25
    Why is it order n?
  • 00:19:27
    This is number of additions.
  • 00:19:28
  • 00:19:31
    Come back to that in a second.
  • 00:19:33
  • 00:19:36
    In general, the way to analyze an algorithm
  • 00:19:39
    like this that uses memoization is we just
  • 00:19:41
    count how many different sub-problems are there?
  • 00:19:43
    Because once we solve the sub-problem,
  • 00:19:45
    we will never solve it again.
  • 00:19:46
    That's the whole idea of a memo table.
  • 00:19:48
    So we will solve each sub-problem at most once.
  • 00:19:51
    And so we just need to count, how much time does it take
  • 00:19:54
    to solve every sub-problem?
  • 00:19:55
    And here you can see it's constant.
  • 00:19:59
    Either it's a base case and it takes constant time
  • 00:20:01
    or we recursively call these things.
  • 00:20:04
    But those are different sub-problems.
  • 00:20:06
    So we're going to count those later.
  • 00:20:08
    And then the work that's actually
  • 00:20:09
    done by this recurrence is a single addition.
  • 00:20:12
    So in fact, it's n additions.
  • 00:20:14
    To compute fn would be exactly n additions.
  • 00:20:20
    So it turns out to be very nice closed form in this case.
  • 00:20:24
    It should be exactly n sub problems to compute f of n
  • 00:20:29
    because we started as dot at 1.
  • 00:20:32
    And each one has one additional--
  • 00:20:35
    I guess not the base case.
  • 00:20:37
    Maybe n minus 2.
  • 00:20:39
    OK.
  • 00:20:40
    Definitely order n.
  • 00:20:43
    Now, there's this one subtlety which--
  • 00:20:46
    let's forget about dynamic programming for a moment
  • 00:20:48
    and go back to good old lecture one and two,
  • 00:20:51
    talking about the word ram model of computation.
  • 00:20:55
    A question here that usually doesn't matter in this class.
  • 00:20:58
    Usually we assume additions take constant time.
  • 00:21:01
    And we usually do that because it's usually true.
  • 00:21:04
    And in general, our model is the w bit additions--
  • 00:21:09
    where w is our machine word size--
  • 00:21:12
    takes constant time.
  • 00:21:13
  • 00:21:19
    But for this problem and this problem only,
  • 00:21:22
    pretty much, for Fibonacci numbers,
  • 00:21:24
    I happen to know that the Fibonacci
  • 00:21:25
    numbers grow exponentially.
  • 00:21:27
    So to write them down actually requires theta n bits
  • 00:21:32
    because they are some constant to the n power.
  • 00:21:35
    And so they're actually really big .
  • 00:21:38
    n is probably bigger than w.
  • 00:21:40
    Usually you think of problems that are much bigger than 64
  • 00:21:44
    or whatever your word size happens to be.
  • 00:21:46
    We do assume that w is at least log n.
  • 00:21:48
    But n is probably bigger than w.
  • 00:21:51
    It might be bigger or smaller.
  • 00:21:52
    We don't know.
  • 00:21:53
    And in general, to do an n bit addition--
  • 00:21:57
    these are n bit additions--
  • 00:22:01
    is going to take ceiling of n over w time.
  • 00:22:07
    So in the end, we will spend this times n,
  • 00:22:11
    because we have to do that, many of them,
  • 00:22:12
    which is n plus n squared over w time.
  • 00:22:18
    So a bit of a weird running time.
  • 00:22:19
    But it's polynomial, whereas this original recursive
  • 00:22:23
    algorithm was exponential here.
  • 00:22:26
    Using this one simple idea of just remembering
  • 00:22:28
    the work we've done, suddenly this exponential time algorithm
  • 00:22:31
    becomes polynomial.
  • 00:22:32
    Why?
  • 00:22:33
    Because we have few sub problems.
  • 00:22:35
    We had n sub problems.
  • 00:22:39
    And for each sub problem, we could write a recurrence
  • 00:22:42
    relation that if we already knew the solutions to smaller sub
  • 00:22:46
    problems, we could compute this bigger
  • 00:22:48
    problem very efficiently.
  • 00:22:50
    This happened to be constant time or constant additions.
  • 00:22:56
    n over w time.
  • 00:22:57
    But as long as this is polynomial
  • 00:22:59
    and this is polynomial, we're happy,
  • 00:23:02
    because we have this nice formula that the time it takes
  • 00:23:07
    is, at most, the sum over all sub problems of the relation
  • 00:23:15
    time.
  • 00:23:15
  • 00:23:18
    So I'm referring to sub problems, like a number of them
  • 00:23:23
    and the time it takes to evaluate this,
  • 00:23:26
    ignoring the recursive calls.
  • 00:23:28
    That's important.
  • 00:23:28
    This is the non recursive part.
  • 00:23:33
  • 00:23:38
    In the notes, I call this non-recursive work.
  • 00:23:40
  • 00:23:45
    So this formula gives us a way to bound
  • 00:23:49
    the running time of one of these algorithms
  • 00:23:52
    if we use memoization.
  • 00:23:54
    Without memoization, this is not true,
  • 00:23:55
    Fibonacci to exponential time.
  • 00:23:57
    But if we add memoization, we know that we only
  • 00:23:59
    solve each sub-problem once.
  • 00:24:01
    And so we just need to see, for each one,
  • 00:24:03
    how much did it cost me to compute it,
  • 00:24:05
    assuming all the recursion work is free,
  • 00:24:07
    because that's already taken into account by the summation.
  • 00:24:11
    So in particular, this summation is at most
  • 00:24:12
    the number of sub-problems times the time per sub-problem,
  • 00:24:15
    which in this case was order n.
  • 00:24:17
    We could try to apply that analysis to merge sort,
  • 00:24:20
    because after all, this is also a recursive algorithm.
  • 00:24:23
    It happens to not benefit from memoization.
  • 00:24:25
    But we could throw in memoization.
  • 00:24:27
    It wouldn't hurt us.
  • 00:24:29
    But if you think about the call graph here,
  • 00:24:31
    which is like s of 0 m, which calls s of m--
  • 00:24:38
    0 n over 2 and o of n over 2n and so on.
  • 00:24:45
    It has the same picture, but there's actually
  • 00:24:47
    no common substructure here.
  • 00:24:49
    You'll never see a repeated sub-problem,
  • 00:24:51
    because this range is completely disjoined from this range.
  • 00:24:55
    But you could throw in memoization
  • 00:24:56
    and try to analyze in the same way
  • 00:24:58
    and say, well, how many sub-problems are there?
  • 00:25:01
    It looks like there's n choices for i and not quite n choices
  • 00:25:07
    but it's at most n squared different choices.
  • 00:25:11
    In fact, it's the triangular number sum of i equals 1 to n
  • 00:25:16
    of i, different possible choices for inj.
  • 00:25:20
    But this is theta n squared sub-problems,
  • 00:25:24
    which seems not so good.
  • 00:25:27
    And then how much time are we spending per sub problem?
  • 00:25:29
    Well, to solve s of ij, we have to merge
  • 00:25:33
    about that many elements.
  • 00:25:35
    We know merge takes linear time.
  • 00:25:36
    And so this takes theta j minus i time to evaluate.
  • 00:25:43
    And so what we'd like to do is sum over all the sub
  • 00:25:46
    problems of j minus i.
  • 00:25:48
    This is the not triangular number
  • 00:25:51
    but the tetrahedral number, I guess.
  • 00:25:53
    And so we end up that the running time
  • 00:25:56
    is, at most, n cubed.
  • 00:25:58
  • 00:26:00
    Great.
  • 00:26:02
    So it's true that n log n is less than or equal to n cubed,
  • 00:26:05
    but obviously not terribly useful.
  • 00:26:07
    This algorithm by the way we already know how to analyze it
  • 00:26:11
    is, indeed, n log n.
  • 00:26:12
    And the running time turns out to be theta n log n.
  • 00:26:17
    So sometimes this equation is not what you want to use.
  • 00:26:20
    But often it's good enough.
  • 00:26:22
    And especially if you just want to get
  • 00:26:23
    a polynomial upper bound, then you
  • 00:26:25
    can try to optimize it later.
  • 00:26:27
    This will give you a polynomial upper
  • 00:26:28
    bound as long as the number of sub-problems is polynomial
  • 00:26:31
    and the time per sub-problem is polynomial.
  • 00:26:34
    And indeed, n cubed is polynomial.
  • 00:26:35
    It's not a great polynomial, but this is an alternate way
  • 00:26:39
    to analyze merge sort.
  • 00:26:40
    Obviously don't do this for merge sort.
  • 00:26:43
    But it illustrates the technique.
  • 00:26:47
    Good so far?
  • 00:26:49
    Any questions?
  • 00:26:51
    All right.
  • 00:26:52
    Let me remember where we are.
  • 00:26:57
    Cool.
  • 00:26:58
    So the next thing I'd like to do is show you one more algorithm
  • 00:27:02
    that we've already seen in this class that fits very
  • 00:27:04
    nicely into this structure--
  • 00:27:07
    arguably is a dynamic program--
  • 00:27:09
    and that is DAG shortest paths.
  • 00:27:13
    So just to close the loop here, when I say dynamic programming,
  • 00:27:17
    I mean recursion with memoization.
  • 00:27:21
    I mean, we take--
  • 00:27:24
    we write a recursive piece of code,
  • 00:27:28
    which is like def f of some args,
  • 00:27:32
    some sub-problem specification.
  • 00:27:38
    We check is the problem in the memo table?
  • 00:27:44
    If so, return memo of sub-problem.
  • 00:27:52
  • 00:27:56
    And otherwise check if it's a base case
  • 00:28:01
    and solve it if it's a base case.
  • 00:28:03
    And otherwise, write the recurrence recurse
  • 00:28:09
    via relation.
  • 00:28:11
  • 00:28:14
    And set the memo table of the sub-problem
  • 00:28:20
    to be one of those things.
  • 00:28:23
    OK, so this is the generic dynamic program.
  • 00:28:26
    And implicitly, I'm writing Fibonacci in that way.
  • 00:28:32
    And all of the dynamic programs have this implicit structure
  • 00:28:35
    where I start with a memo table which is empty
  • 00:28:41
    and I always just check if I'm in the memo table.
  • 00:28:44
    If I am, I return it.
  • 00:28:45
    Otherwise I compute according to this recursive relation
  • 00:28:50
    by recursively calling f.
  • 00:28:54
    And that's it.
  • 00:28:55
    So this is every DP algorithm is going to have that structure.
  • 00:29:00
    And it's just using recursion and memoization together.
  • 00:29:04
    OK, so now let's apply that technique
  • 00:29:06
    to think about the DAG shortest paths problem.
  • 00:29:10
    The problem was, I give you a DAG.
  • 00:29:12
    I give you a source vertex, S--
  • 00:29:15
    single source shortest paths.
  • 00:29:16
    Compute the shortest path weight from S to every vertex.
  • 00:29:20
    That's the goal of the problem.
  • 00:29:22
    And we saw a way to solve that, which is DAG relaxation.
  • 00:29:24
    I'm going to show you a different way, which turns out
  • 00:29:27
    to be basically the same, but upside down, or flipped left
  • 00:29:31
    right, depending which way you direct your edges.
  • 00:29:35
    So what are our sub-problems?
  • 00:29:37
    Well, here, actually, they're kind of spelled out for us.
  • 00:29:40
    We want to compute delta and SV for all these.
  • 00:29:42
    So that is size of these sub-problems.
  • 00:29:47
    That turns out to be enough for this overall problem.
  • 00:29:51
    And the original problem we want to solve
  • 00:29:53
    is all of the sub-problems.
  • 00:29:55
    We solve all the sub-problems, we're done.
  • 00:29:57
    And then we have--
  • 00:29:59
    I think we wrote this at some point during the DAG
  • 00:30:02
    shortest paths lecture--
  • 00:30:03
    we have a recursive relation saying
  • 00:30:05
    that the shortest way to get from s to v
  • 00:30:09
    is the minimum of the shortest path
  • 00:30:12
    to get to some vertex u plus the weight of the edge from u to v.
  • 00:30:16
    Why?
  • 00:30:16
    Because if we look at a vertex v, unless we started there,
  • 00:30:21
    we came from somewhere.
  • 00:30:23
    And so we can consider all of the possible choices
  • 00:30:27
    for the previous vertex u.
  • 00:30:30
    And if you start at s and get to v,
  • 00:30:32
    you must go through one of them.
  • 00:30:34
    And so this is finding the best way among all the choices of u.
  • 00:30:39
    What's the best way to get to u?
  • 00:30:40
    And then take the edge from u to v for all edges uv.
  • 00:30:44
    And this is adjacency minus.
  • 00:30:46
    We don't usually think of that.
  • 00:30:48
    Usually we look at adjacency plus the outgoing edges.
  • 00:30:51
    This is the incoming edges.
  • 00:30:52
    And so u is an incoming--
  • 00:30:55
    uv is an incoming edge into v. OK, if we take that minimum--
  • 00:30:59
    and of course, possible there is no way to get to v.
  • 00:31:03
    And so I'll also throw infinity into the set.
  • 00:31:06
    Take the min of that set.
  • 00:31:07
    That will give me the shortest pathway
  • 00:31:09
    in an acyclic graph from s to v. And great, this is recursive.
  • 00:31:13
    This was a sub problem.
  • 00:31:14
    These are sub problems which are smaller, I guess.
  • 00:31:19
    There's no clear notion of smaller
  • 00:31:21
    here, except we already know the clear notion of smaller
  • 00:31:25
    is the topological order of our DAG.
  • 00:31:30
    Because our graph is acyclic, we know
  • 00:31:32
    it has a topological order.
  • 00:31:33
    We know how to compute it with DFS.
  • 00:31:36
    And so that guarantees there's a topological order
  • 00:31:39
    to compute these problems.
  • 00:31:41
    And in fact, the relationship between problems
  • 00:31:46
    is exactly the given graph, G. In order
  • 00:31:50
    to compute the shortest pathway from s to v,
  • 00:31:54
    I need to know the shortest pathway from s
  • 00:31:56
    to all of the incoming vertices to v.
  • 00:31:58
    And so this is I guess in the call graph,
  • 00:32:02
    this vertex calls this vertex, but direct the edge this way
  • 00:32:07
    to say that this vertex requires--
  • 00:32:11
    this vertex needs to be computed before this one.
  • 00:32:14
    And so then I can complete them in a topological order.
  • 00:32:18
    OK, we have a base case, which is delta of ss equals 0.
  • 00:32:22
    And the running time is, again, we can use this formula
  • 00:32:26
    and say, let's just sum over all the sub problems of the non
  • 00:32:29
    recursive work in our recurrence relation
  • 00:32:32
    and so it's computing this min.
  • 00:32:34
    If I gave you these deltas for free
  • 00:32:38
    and I gave you these weights, which we know from our weight
  • 00:32:41
    data structure, how long does it take to compute this min?
  • 00:32:44
    Well, however many things there are, however
  • 00:32:46
    many numbers we're minning, which
  • 00:32:48
    is the size of the incoming adjacency list plus 1
  • 00:32:52
    for that infinity.
  • 00:32:54
    And so if you compute this sum, sum of incoming edges
  • 00:32:57
    to every vertex, that's all the edges.
  • 00:33:00
    So this is v plus e.
  • 00:33:03
    So in fact, this algorithm is morally the same algorithm
  • 00:33:09
    as the one that we saw on the DAG shortest path
  • 00:33:11
    lecture, which was compute a topological order and process
  • 00:33:18
    vertices in that order and relax edges going out from vertices.
  • 00:33:23
    So here-- so in that algorithm, we
  • 00:33:26
    would have tried to relax this edge if there was a better
  • 00:33:30
    path to v. And the first one certainly
  • 00:33:32
    is better than infinity.
  • 00:33:33
    So the first one we relax indeed.
  • 00:33:36
    The next edge, if this gave a better path from s to v,
  • 00:33:39
    then we would relax that edge and update the way here
  • 00:33:42
    and do the same here.
  • 00:33:43
    In the end, we're just computing this min in the relaxation
  • 00:33:46
    algorithm but doing it step by step.
  • 00:33:48
    In the relaxation algorithm, DAG relaxation,
  • 00:33:51
    for each incoming edge to v, we update d of e if it's better.
  • 00:33:58
    And so if you repeatedly update if you're better,
  • 00:34:02
    that ends up computing a min.
  • 00:34:04
    OK, so this is the same algorithm
  • 00:34:06
    just kind of flipped backwards.
  • 00:34:08
    A funny thing, although we wrote down
  • 00:34:11
    the topological order of the sub problem graph
  • 00:34:15
    here is the topological order of g,
  • 00:34:16
    because the sub-problem graph is g,
  • 00:34:22
    the algorithm doesn't actually have to compute one.
  • 00:34:24
    It's doing it automatically for free.
  • 00:34:27
    If you think about this algorithm,
  • 00:34:31
    generic dp algorithm, which is check
  • 00:34:34
    whether we're in a memo table.
  • 00:34:35
    If so, return.
  • 00:34:36
    Otherwise, recurse, or base case.
  • 00:34:40
    This actually is a depth-first search
  • 00:34:43
    through the sub-problem graph-- technically through the reverse
  • 00:34:46
    of the sub-problem graph.
  • 00:34:48
    If I draw an edge--
  • 00:34:51
    so from small to big--
  • 00:34:58
    so I'm just saying, I orient the edges from my smaller
  • 00:35:01
    sub-problems to the ones that need it--
  • 00:35:04
    then I'm actually depth-first searching backwards
  • 00:35:06
    in this graph because the bigger problem calls the smaller
  • 00:35:10
    problem.
  • 00:35:12
    And the memo table is serving as the "have I visited this vertex
  • 00:35:16
    already" check in DFS.
  • 00:35:18
    So this is actually a DFS algorithm.
  • 00:35:20
    Plus we're doing some computation to actually solve
  • 00:35:24
    the sub-problems we care about.
  • 00:35:27
    So implicit in this algorithm, we are doing a DFS,
  • 00:35:29
    and at the same time, we're doing this shortest path
  • 00:35:32
    computation in the finishing order of that DFS traversal
  • 00:35:37
    because all the edges are backwards.
  • 00:35:39
    This is the same as the reverse finishing order
  • 00:35:41
    if the graph is forwards.
  • 00:35:42
    So in the end, we're computing a topological order
  • 00:35:46
    because dynamic programming includes
  • 00:35:49
    in it depth first search.
  • 00:35:52
    A lot of words.
  • 00:35:54
    But it's kind of cool that this framework just
  • 00:35:57
    solves DAG shortest paths without much work.
  • 00:36:02
    I mean, we did a lot of work in shortest paths
  • 00:36:04
    to prove that this relation is true.
  • 00:36:05
    Once you know it's true, the algorithm part
  • 00:36:08
    is pretty much free.
  • 00:36:11
    You just write down SRTBOT and you're done.
  • 00:36:15
    OK.
  • 00:36:18
    This brings us to in general--
  • 00:36:24
    at this point we have seen two examples
  • 00:36:26
    of dynamic programming.
  • 00:36:27
    I guess technically merge sort you
  • 00:36:28
    could think of as a dynamic program,
  • 00:36:30
    but it doesn't actually reuse anything.
  • 00:36:31
    So it's not interesting.
  • 00:36:32
    And indeed, that gave us a really bad bound.
  • 00:36:34
    We've definitely seen DAG shortest paths and Fibonacci
  • 00:36:37
    numbers as two interesting examples.
  • 00:36:39
    And what the next remainder of this lecture and the next three
  • 00:36:43
    lectures are going to be about is more and more examples
  • 00:36:46
    of dynamic programming and how you
  • 00:36:48
    can use it to solve increasingly general problems.
  • 00:36:51
    So far, we've just solved an easy problem and a problem
  • 00:36:53
    we already knew how to solve.
  • 00:36:55
    Let's go to a new problem, which is bowling.
  • 00:37:02
    Bowling is popular in Boston.
  • 00:37:04
  • 00:37:06
    Boston likes to play candlepin bowling, which
  • 00:37:10
    is a bit unusual.
  • 00:37:12
    Today we're going to play an even more unusual bowling
  • 00:37:15
    game, one that I made up based on a bowling game
  • 00:37:19
    that Henry [INAUDIBLE] made up in 1908.
  • 00:37:22
    So ancient bowling, I'll call it,
  • 00:37:26
    or I think linear bowling is what I might call it.
  • 00:37:29
    I'll just call it bowling here.
  • 00:37:31
    And now I'm going to attempt to draw a bowling pin.
  • 00:37:34
    Not bad.
  • 00:37:36
    They might get progressively worse.
  • 00:37:39
    So imagine n identical bowling pins.
  • 00:37:42
    Please pretend these are identical.
  • 00:37:44
    And I have a ball which is approximately the same size
  • 00:37:49
    as a bowling pin.
  • 00:37:50
    These bowling pins are pretty close together.
  • 00:37:52
    I should have left a little gap here.
  • 00:37:54
    And you are a really good bowler.
  • 00:37:57
    Now, unfortunately, these bowling pins are on a line.
  • 00:38:00
    And you're bowling from way down at infinity.
  • 00:38:03
    So when you bowl, you can only hit one pin
  • 00:38:08
    or two pins or zero pins.
  • 00:38:10
    But probably you want to hit some pins.
  • 00:38:12
    So if you bowl straight out of pin,
  • 00:38:14
    you will just hit that one pin.
  • 00:38:16
    And if you bowl in the middle between two pins,
  • 00:38:20
    you will knock down--
  • 00:38:22
    that's a ball, sorry--
  • 00:38:23
    you will knock down two pins.
  • 00:38:26
    And this is your model of bowling, model of computation.
  • 00:38:30
    Now, what makes this interesting is that the pins have values.
  • 00:38:36
    Pin i has value--
  • 00:38:39
    this is obviously a toy problem, though this problem--
  • 00:38:44
    this type of bowling does go back to 1908,
  • 00:38:47
    it was also a toy problem in that setting.
  • 00:38:50
    So each of these bowling pins has some number on it,
  • 00:38:55
    let's say 1, 9, 9--
  • 00:38:57
  • 00:39:01
    I'll do a slightly more interesting example,
  • 00:39:03
    maybe another one here and a 2 and a 5 and a 5, something
  • 00:39:11
    like this.
  • 00:39:13
    OK.
  • 00:39:15
    Or maybe make it a little more interesting.
  • 00:39:17
    Let's put some negative numbers on here.
  • 00:39:19
    OK.
  • 00:39:19
    And the model-- so you're at the carnival bowling.
  • 00:39:24
    Each pin has different-- potentially different values.
  • 00:39:27
    And the model is if you hit one pin, i, then you get vi points.
  • 00:39:38
    So that's straight forward.
  • 00:39:40
    To make it interesting, when you hit two pins,
  • 00:39:43
    you get the product.
  • 00:39:45
    So if I hit two pins, it's always
  • 00:39:46
    i and i plus 1 for some I. You get vi times vi plus 1 points.
  • 00:39:53
  • 00:39:56
    This is the game you're playing.
  • 00:39:58
    And it doesn't really matter that this is a product.
  • 00:40:00
    Product is just some weird function
  • 00:40:03
    that's hard to imagine.
  • 00:40:05
    If you stare at this long enough,
  • 00:40:07
    you should convince yourself that the optimal solution
  • 00:40:10
    is probably to--
  • 00:40:11
    so, for each of these numbers, I could
  • 00:40:13
    leave it singleton or pair it with its left neighbor
  • 00:40:15
    or pair it with its right neighbor.
  • 00:40:17
    But the pairings can't overlap because once I hit a pin,
  • 00:40:19
    it's gone.
  • 00:40:20
    It's knocked over.
  • 00:40:21
    It disappears.
  • 00:40:22
    So because of these nine, which are a very high value, what
  • 00:40:26
    I'd probably like to do is hit both of them together,
  • 00:40:28
    so pair them up, because 9 times 9 is 81.
  • 00:40:32
    That's really big, much better than hitting them individually
  • 00:40:34
    or hitting 9 times 1 or 9 times 2.
  • 00:40:37
    1 and 1 is kind of funny, because it's actually better
  • 00:40:40
    to hit them individually.
  • 00:40:41
    That will give you two points, whereas if I'd pair them up,
  • 00:40:43
    I only get one point.
  • 00:40:46
    2 and minus 5, that seems bad.
  • 00:40:47
    Negative 10 points.
  • 00:40:48
    My goal is to maximize score.
  • 00:40:50
  • 00:40:56
    Do you have to hit all the pins?
  • 00:41:00
    Let's say no, you don't have to hit all the pins.
  • 00:41:02
    So I could skip the minus fives.
  • 00:41:04
    But in fact, here, because they're adjacent,
  • 00:41:07
    minus 5 times minus 5 is good.
  • 00:41:09
    That's 25 points.
  • 00:41:11
    So the optimal solution for this particular instance
  • 00:41:14
    are to hit all the pins, these positive,
  • 00:41:18
    these together, these together.
  • 00:41:19
    If I added, for example, another pin of minus 3 here,
  • 00:41:23
    I would choose not to hit that pin.
  • 00:41:25
    Good question.
  • 00:41:26
    So you just play until you are tired.
  • 00:41:29
    When you decide to stop playing, how can I maximize your score?
  • 00:41:32
    There are many variations of this game.
  • 00:41:34
    All of them-- basically any variation--
  • 00:41:37
    not literally every variation, but many,
  • 00:41:40
    many variations of this problem can all be solved quickly
  • 00:41:43
    with dynamic programming.
  • 00:41:44
    But let's solve this particular one.
  • 00:41:47
    OK.
  • 00:41:48
  • 00:41:54
    So now we're really in algorithmic design mode.
  • 00:41:58
    We need to think about SRTBOT.
  • 00:42:01
    And in particular, we need to think about what would the sub
  • 00:42:04
    problems be here?
  • 00:42:06
    And at this point, we don't have a lot of help.
  • 00:42:08
    So I should probably give you some tools.
  • 00:42:11
    If I want to solve a problem like this,
  • 00:42:13
    the input is a sequence of numbers.
  • 00:42:18
    It's a sequenced data structure.
  • 00:42:19
    Maybe it's an array of numbers, which is this v array.
  • 00:42:23
  • 00:42:26
    And let's see.
  • 00:42:28
  • 00:42:31
    A general tool for sub-problem design
  • 00:42:39
    which will cover most of the problems-- maybe all
  • 00:42:42
    of the problems that we see in this class
  • 00:42:45
    for dynamic programming.
  • 00:42:47
    Here's a trick.
  • 00:42:49
    If your input is a sequence, here are some good sub-problems
  • 00:43:03
    to consider.
  • 00:43:04
  • 00:43:13
    We could do all prefixes.
  • 00:43:15
  • 00:43:18
    So let's call the sequence x.
  • 00:43:22
    So we could do x prefix means up to a given i for all i.
  • 00:43:27
    We could do all the suffixes, x from i onward for all i.
  • 00:43:34
    Or we could do substrings, which are the consecutive items
  • 00:43:41
    from i to j.
  • 00:43:43
    I don't write subsequence here.
  • 00:43:44
    Subsequence means you can omit items in the middle.
  • 00:43:46
    So substring you have to start in some position
  • 00:43:49
    and do all the things up to j.
  • 00:43:51
    So these are nice, easy to express in Python notation.
  • 00:43:55
    And these are great, because they're polynomial.
  • 00:43:57
    If I have n things--
  • 00:43:59
    if the length of my sequence, x, is n,
  • 00:44:02
    then there are n prefixes-- technically n plus 1.
  • 00:44:05
    So let's do theta n prefixes.
  • 00:44:08
    There are theta n suffixes.
  • 00:44:10
    And there are theta n squared substrings
  • 00:44:14
    because there's n-- roughly n choices for i and j separately.
  • 00:44:19
    Sorry?
  • 00:44:21
    Sub-sequences.
  • 00:44:21
    Good.
  • 00:44:22
    Right.
  • 00:44:22
    I didn't write sub-sequences, because in fact, there
  • 00:44:24
    are exponentially many sub sequences.
  • 00:44:26
    It's 2 to the n.
  • 00:44:27
    For every item, I could choose it or not.
  • 00:44:29
    So I don't want to parameterize--
  • 00:44:31
    I don't want my sub problems to be sub sequences because that's
  • 00:44:35
    guaranteed-- well, then you're guaranteed
  • 00:44:37
    to get an exponential number of sub-problems, which is bad.
  • 00:44:40
    We'd like to balance the numbers of sub-problems by polynomial.
  • 00:44:43
    So these are three natural ways to get polynomial bounds.
  • 00:44:47
    Now, prefixes and suffixes are obviously better
  • 00:44:50
    because there's fewer of them, linear instead of quadratic.
  • 00:44:53
    And usually almost every problem you encounter,
  • 00:44:56
    prefixes and suffixes are equally good.
  • 00:44:58
    It doesn't really matter which one you choose.
  • 00:45:00
    So maybe you'd like to think of--
  • 00:45:03
    well, we'll get to--
  • 00:45:04
  • 00:45:06
    just choose whichever is more comfortable for you.
  • 00:45:11
    But sometimes it's not enough.
  • 00:45:12
    And we'll have to go to substrings.
  • 00:45:13
    That won't be for another lecture or two.
  • 00:45:16
    Today I claim that prefixes or suffixes
  • 00:45:19
    are enough to solve the bowling problem.
  • 00:45:24
    So what we're going to do is think about--
  • 00:45:26
    I prefer suffixes usually, because I
  • 00:45:29
    like to work from left to right, from the beginning to the end.
  • 00:45:32
    So we're going to think of a suffix of the bowling pins.
  • 00:45:37
    And so what is the sub-problem on a suffix?
  • 00:45:39
    Well, a natural version is just to solve the original problem,
  • 00:45:43
    bowling.
  • 00:45:44
    How do I maximize my score if all I were given
  • 00:45:46
    were these pins?
  • 00:45:47
    Suppose the pins to the left of i didn't exist.
  • 00:45:50
    How would I maximize my score on the remaining pins?
  • 00:45:53
    Or for this suffix, given these four pins, what would I do?
  • 00:45:56
    And there's some weird sub problems here.
  • 00:45:58
    If I just gave you the last pin, what would you do?
  • 00:46:00
    Nothing.
  • 00:46:01
    That's clearly different from what I would do globally here.
  • 00:46:04
    But I claim if I can solve all suffixes
  • 00:46:06
    I can solve my original problem, because one of the suffixes
  • 00:46:09
    is the whole sequence.
  • 00:46:11
  • 00:46:14
    So let's do it.
  • 00:46:15
  • 00:46:18
    Sort by for bowling.
  • 00:46:22
  • 00:46:26
    So here is our dynamic program.
  • 00:46:29
    The sub-problems are suffixes.
  • 00:46:34
    So I'll write b of i is the maximum score we could get
  • 00:46:42
    possible with our starting--
  • 00:46:50
    if we started a game with pins i, i plus 1, up to n minus 1,
  • 00:47:00
    which is a suffix of the pins.
  • 00:47:02
    Very important whenever you write a dynamic program
  • 00:47:04
    to define what your sub-problems are.
  • 00:47:06
    Don't just say how to compute them,
  • 00:47:08
    but first say what is the goal of the sub problem.
  • 00:47:10
    This is a common mistake to forget to state
  • 00:47:13
    what you're trying to do.
  • 00:47:15
    So now I have defined b of i.
  • 00:47:18
    Now, what is the original thing I'm trying to solve?
  • 00:47:23
    You also put in SRTBOT--
  • 00:47:24
    you could put the O earlier, then it actually spells sort.
  • 00:47:28
    So why don't I do that for fun.
  • 00:47:30
    The original problem we're trying to solve is b of 0,
  • 00:47:35
    because that is all of the pins.
  • 00:47:36
    The suffix starting at 0 is everything.
  • 00:47:39
    So if we can solve that, we're done.
  • 00:47:42
    Next is r for relate.
  • 00:47:46
    This is the test of, did I get the sub-problems right,
  • 00:47:49
    is whether I can write a recurrence relation.
  • 00:47:53
    So let's try to do it.
  • 00:47:54
    We want to compute b of i.
  • 00:47:58
    So we have pin i here and then the remaining pins.
  • 00:48:04
  • 00:48:07
    And the big idea here is to just think about--
  • 00:48:10
    the nice thing about suffixes is if I take off
  • 00:48:13
    something from the beginning, I still have a suffix.
  • 00:48:15
    Remember, my goal is to take this sub-problem, which
  • 00:48:18
    is suffix starting at i, and reduce it
  • 00:48:20
    to a smaller sub problem, which means a smaller suffix.
  • 00:48:23
    So I'd like to clip off one or two items here.
  • 00:48:28
    And then the remaining problem will be one of my sub problems.
  • 00:48:33
    I'll be able to recursively call b of something smaller than i--
  • 00:48:37
    or sorry, b of something larger than i
  • 00:48:39
    will be a smaller subsequence because we're starting later.
  • 00:48:43
    OK, so what could I do?
  • 00:48:44
    Well, the idea is to just look at pin i
  • 00:48:47
    and think, well, what could I do to pin i?
  • 00:48:50
    I could not hit it ever with a ball.
  • 00:48:52
    I could skip it.
  • 00:48:53
    That's one option.
  • 00:48:54
    What would be my score then?
  • 00:48:57
    Well, if I skip pin i, that leaves the remaining pins,
  • 00:49:02
    which is just a smaller suffix.
  • 00:49:04
    So that is b of i plus 1.
  • 00:49:08
    I'm going to write a max out here
  • 00:49:10
    because I'd like to maximize my score.
  • 00:49:12
    And one of the options is, forget about pin i.
  • 00:49:14
    Just solve the rest.
  • 00:49:16
    Another option is I throw a ball.
  • 00:49:18
    And I exactly hit pin i.
  • 00:49:21
    That's one thing I could do.
  • 00:49:22
    And it would leave exactly the same remainder.
  • 00:49:26
    So another option is b of i plus 1 plus vi.
  • 00:49:34
    Why would I prefer this over this?
  • 00:49:36
    Well, if vi is negative, I'd prefer this.
  • 00:49:39
    But if vi is positive, I'd actually prefer this over that.
  • 00:49:42
    So you can figure out which is better, just locally.
  • 00:49:47
    But then there's another thing I can do,
  • 00:49:49
    which is maybe I hit this pin in a pair with some other pin.
  • 00:49:54
    Now, there's no pin to the left of this one.
  • 00:49:56
    We're assuming we only have the suffix.
  • 00:49:58
    And so the only other thing I can do is throw a ball
  • 00:50:01
    and hit i together with i plus 1.
  • 00:50:03
    And then I get the product.
  • 00:50:05
    Now, what pins remain?
  • 00:50:07
    i plus 2 on.
  • 00:50:08
    Still a suffix.
  • 00:50:09
    So if I remove one or two items, of course,
  • 00:50:12
    I still get a suffix--
  • 00:50:13
    in this case, b of i plus 2--
  • 00:50:15
    and then the number of points that I add on
  • 00:50:18
    are vi times vi plus 1.
  • 00:50:21
    So this is a max of three things.
  • 00:50:26
    So how long does it take me to compute it?
  • 00:50:28
    I claim constant time.
  • 00:50:30
    If I don't count the time it takes
  • 00:50:31
    to compute these other sub problems, which
  • 00:50:33
    are smaller because they are smaller suffixes further
  • 00:50:37
    to the right, then I'm doing a couple
  • 00:50:40
    of additions-- product, max.
  • 00:50:42
    These are all nice numbers and I'll
  • 00:50:44
    assume that they live in the w-bit word,
  • 00:50:48
    because we're only doing constant sized products.
  • 00:50:50
    That's good.
  • 00:50:51
    So this takes constant, constant non-recursive work.
  • 00:50:56
    How many sub problems are?
  • 00:50:57
    Well, it's suffixes, so it's a linear number of sub problems.
  • 00:51:01
    And so the time I'm going to end up
  • 00:51:03
    needing is number of sub problems, n,
  • 00:51:07
    times the non-recursive work I do per sub problem, which
  • 00:51:11
    is constant.
  • 00:51:12
    And so this is linear time.
  • 00:51:16
    Great.
  • 00:51:16
    And I didn't finish SRTBOT, so there's
  • 00:51:19
    another t, which is to make sure that there
  • 00:51:21
    is a topological order and that is in decreasing i order.
  • 00:51:27
  • 00:51:30
    Or I might write that as a for loop--
  • 00:51:33
    for i equals n, n minus 1.
  • 00:51:37
    This is the order that I would compute my problems
  • 00:51:41
    because the suffix starting at n is the empty suffix.
  • 00:51:44
    The suffix starting at 0, that's the one I actually
  • 00:51:46
    want to compute.
  • 00:51:47
    That's the final suffix I should be computing.
  • 00:51:50
    And then we have a b for base case,
  • 00:51:53
    which is that first case, b of n equals 0,
  • 00:52:00
    because there's no pins.
  • 00:52:02
    So I don't get any points.
  • 00:52:04
    Sad.
  • 00:52:04
  • 00:52:07
    OK, so this is it.
  • 00:52:09
    We just take these components, plug them
  • 00:52:12
    into this recursive, memoized algorithm,
  • 00:52:14
    and we have a linear time algorithm.
  • 00:52:17
    I want to briefly mention a different way
  • 00:52:18
    you could plug together those pieces, which is called bottom
  • 00:52:21
    up dp, which is--
  • 00:52:27
    let's do it for this example.
  • 00:52:29
    So if I have--
  • 00:52:32
    let's see.
  • 00:52:34
    Let me start with the base case, b of n equals 0.
  • 00:52:39
    But now it's an assignment.
  • 00:52:41
    And I'm going to do for loop from the topological order
  • 00:52:44
    for i equals n, n minus 1 to 0.
  • 00:52:49
    Now I'm going to do the relation, b of i equals
  • 00:52:53
    max of b of i plus 1 and b of i plus 1 plus bi
  • 00:53:02
    and b of i plus 2 plus di vi plus 1.
  • 00:53:08
    Technically this only works if i is strictly less than n
  • 00:53:11
    minus 1.
  • 00:53:12
    So I should have an if i is less than minus 1 for that last part
  • 00:53:17
    because I can only do--
  • 00:53:18
    I can only hit two pins if there's at least two pins left.
  • 00:53:22
    And then return b of 0.
  • 00:53:27
    So what I just did is a transformation
  • 00:53:29
    from this SRTBOT template into a non-recursive algorithm, a
  • 00:53:35
    for loop algorithm, where I wrote my base case first.
  • 00:53:39
    Then I did my topological order.
  • 00:53:43
    Then I did my relation.
  • 00:53:46
    Then at the end, I did my--
  • 00:53:48
    not base case.
  • 00:53:49
    The original problem.
  • 00:53:50
  • 00:53:53
    And provided you can write your topological order
  • 00:53:56
    as some for loops.
  • 00:53:57
    This is actually a great way to write down a dp as code.
  • 00:54:00
    If I were going to implement this algorithm,
  • 00:54:02
    I would write it this way, because this is super fast.
  • 00:54:04
    No recursive calls.
  • 00:54:05
    Just one for loop.
  • 00:54:06
    In fact, this is almost a trivial algorithm.
  • 00:54:08
    It's amazing that this solves the bowling problem.
  • 00:54:12
    It's in some sense considering every possible strategy I could
  • 00:54:16
    for bowling these pins.
  • 00:54:19
    What we're using is what we like to call local brute force,
  • 00:54:22
    where when we think about pin i, we
  • 00:54:25
    look at all of the possible things I could do to pin i,
  • 00:54:28
    here there's really only three options of what I could do.
  • 00:54:31
    Now, normally, if I tried all the options for pin i
  • 00:54:34
    and then all the options for i plus 1 and i plus 2 and so on,
  • 00:54:37
    that would be exponential.
  • 00:54:38
    It'd be 3 times 3 times 3.
  • 00:54:40
    That's bad, but because I can reuse these sub problems,
  • 00:54:44
    it turns out to only be linear time.
  • 00:54:47
    It's almost like magic.
  • 00:54:49
    dp-- dp is essentially an idea of using local brute force.
  • 00:54:56
  • 00:55:02
    And by defining a small number of sub-problems up front--
  • 00:55:06
    and as long as I stay within those sub problems,
  • 00:55:09
    as long as I'm always recursing into this polynomial space,
  • 00:55:12
    I end up only doing polynomial work,
  • 00:55:15
    even though I'm in some sense exploring
  • 00:55:17
    exponentially many options.
  • 00:55:19
    And it is because what I do to this pin
  • 00:55:22
    doesn't depend too much to what I do to a pin much later.
  • 00:55:26
    There's a lot of intuition going on here for what--
  • 00:55:30
    when DP works.
  • 00:55:31
    But we're going to see a lot more examples of that
  • 00:55:33
    coming up.
  • 00:55:36
    And I just want to mention the intuition for how
  • 00:55:38
    to write a recurrence like this is
  • 00:55:40
    to think about-- in the case of suffixes,
  • 00:55:42
    you always want to think about the first item, or maybe
  • 00:55:44
    the first couple of items.
  • 00:55:46
    The case of prefixes, you always think about the last item.
  • 00:55:49
    And for substrings, it could be any item-- maybe in the middle.
  • 00:55:52
    If I remove an item from the middle of a substring,
  • 00:55:54
    I get two substrings, so I can recurse.
  • 00:55:57
    Here or in general, what we want to do
  • 00:56:00
    is identify some feature of the solution
  • 00:56:03
    that if we knew that feature we would be done.
  • 00:56:06
    We would reduce to a smaller sub problem.
  • 00:56:09
    In this case, we just say, well, what
  • 00:56:11
    are the possible things I could do to the first pin?
  • 00:56:15
    There are three options.
  • 00:56:16
    If I knew which option it was, I would be done.
  • 00:56:18
    I could recurse and do my addition.
  • 00:56:21
    Now, I don't know which thing I want to do.
  • 00:56:23
    So I just try them all and take the max.
  • 00:56:26
    And if you're maximizing, you take the max.
  • 00:56:28
    If you're minimizing, you take the min.
  • 00:56:29
    Sometimes you take an or or an and.
  • 00:56:32
    There might be some combination function.
  • 00:56:34
    For optimization problems where you're
  • 00:56:36
    trying to maximize or minimize something,
  • 00:56:37
    like shortest paths you're trying to minimize,
  • 00:56:39
    we put them in here.
  • 00:56:40
    So usually it's min or max.
  • 00:56:42
    And this is extremely powerful.
  • 00:56:46
    All you need to do--
  • 00:56:47
    the hard part is this inspired design part
  • 00:56:50
    where you say, what do I need to know that would
  • 00:56:54
    let me solve my problem?
  • 00:56:56
    And if you can identify that and the number of choices
  • 00:56:58
    for what you need to know is polynomial,
  • 00:57:01
    then you will be able to get a polynomial dynamic program.
  • 00:57:05
    That's the intuition.
  • 00:57:06
    You'll see a lot more examples in the next three lectures.
  • 00:57:10
Tags
  • dynamic programming
  • SRTBOT
  • memoization
  • algorithm design
  • Fibonacci
  • shortest paths
  • recursive algorithms
  • problem-solving
  • sub-problems
  • bowling game