stumpy-dev
diff --git a/‎docs/Tutorial_Matrix_Profiles_For_Streaming_Data.ipynb‎
Lines changed: 75 additions & 66 deletions b/‎docs/Tutorial_Matrix_Profiles_For_Streaming_Data.ipynb‎
Lines changed: 75 additions & 66 deletions
@@ -254,8 +254,8 @@
      "name": "stdout",
      "output_type": "stream",
      "text": [
-      "stumpy.stump: 1022.7s\n",
-      "stumpy.stumpi: 172.0s\n"
+      "stumpy.stump: 1036.3s\n",
+      "stumpy.stumpi: 21.6s\n"
      ]
     }
    ],
@@ -264,7 +264,7 @@
     "T_stream = T_full[:200].copy()\n",
     "m = 100\n",
     "\n",
-    "# # `stumpy.stump` timing\n",
+    "# `stumpy.stump` timing\n",
     "start = time.time()\n",
     "mp = stumpy.stump(T_stream, m)\n",
     "for i in range(200, len(T_full)):\n",
@@ -273,8 +273,8 @@
     "stump_time = time.time() - start\n",
     "\n",
     "# `stumpy.stumpi` timing\n",
-    "start = time.time()\n",
     "stream = stumpy.stumpi(T_stream, m, egress=False)  # Don't egress/remove the oldest data point when streaming\n",
+    "start = time.time()\n",
     "for i in range(200, len(T_full)):\n",
     "    t = T_full[i]\n",
     "    stream.update(t)\n",
@@ -288,28 +288,44 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Setting aside the fact that having more CPUs will speed up both approaches, we clearly see that incremental `stumpy.stumpi` is almost an order of magnitude faster than batch `stumpy.stump` for processing streaming data. In fact for the current hardware, on average, it is taking roughly 0.1 seconds for `stumpy.stump` to analyze each new matrix profile. So, if you have more than 10 new data point arriving every second, then you wouldn't be able to keep up. In contrast, `stumpy.stumpi` should be able to comfortably handle and process ~50+ new data points per second using fairly modest hardware. Additionally, batch `stumpy.stump`, which has a computational complexity of `O(n^2)`, will get even slower as more and more data points get appended to the existing time series while `stumpy.stumpi`, which is essentially `O(1)`, will continue to be highly performant. \n",
+    "Setting aside the fact that having more CPUs will speed up both approaches, we clearly see that incremental `stumpy.stumpi` is  one to two orders of magnitude faster than batch `stumpy.stump` for processing streaming data. In fact for the current hardware, on average, it is taking roughly 0.1 seconds for `stumpy.stump` to analyze each new matrix profile. So, if you have more than 10 new data point arriving every second, then you wouldn't be able to keep up. In contrast, `stumpy.stumpi` should be able to comfortably handle and process ~450+ new data points per second using fairly modest hardware. Additionally, batch `stumpy.stump`, which has a computational complexity of `O(n^2)`, will get even slower as more and more data points get appended to the existing time series while `stumpy.stumpi`, which is essentially `O(1)`, will continue to be highly performant. \n",
     "\n",
-    "In fact, if you <u><b>don't</b></u> care about maintaining the oldest data point and its relationships with the newest data point (i.e., you only care about maintaining a fixed sized sliding window), then you can get even better performance by telling `stumpy.stumpi` to remove/egress the oldest data point (along with its corresponding matrix profile information) by setting the parameter `egress=True` (note that this is actually the default behavior):"
+    "In fact, if you <u><b>don't</b></u> care about maintaining the oldest data point and its relationships with the newest data point (i.e., you only care about maintaining a fixed sized sliding window), then you can get slightly improve the performance by telling `stumpy.stumpi` to remove/egress the oldest data point (along with its corresponding matrix profile information) by setting the parameter `egress=True` when we instantiate our streaming object (note that this is actually the default behavior):"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "stream = stumpy.stumpi(T_stream, m, egress=True)  # Egressing/removing the oldest data point is the default behavior!"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "And now, when we process the same data above:"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": 17,
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
-      "stumpy.stumpi: 125.6s\n"
+      "stumpy.stumpi: 13.3s\n"
      ]
     }
    ],
    "source": [
-    "# `stumpy.stumpi` timing\n",
+    "# `stumpy.stumpi` timing with egress\n",
+    "stream = stumpy.stumpi(T_stream, m, egress=True)\n",
     "start = time.time()\n",
-    "stream = stumpy.stumpi(T_stream, m, egress=True)  # This is actually the default behavior in `stumpy.stumpi`\n",
     "for i in range(200, len(T_full)):\n",
     "    t = T_full[i]\n",
     "    stream.update(t)\n",
@@ -318,13 +334,6 @@
     "print(f\"stumpy.stumpi: {np.round(stumpi_time, 1)}s\")"
    ]
   },
-  {
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "Now, by egressing the oldest data point in our time series, we can handle a data stream that sees ~80 data points arriving every second!"
-   ]
-  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -345,7 +354,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 14,
+   "execution_count": 18,
    "metadata": {},
    "outputs": [
     {
@@ -424,7 +433,7 @@
        "4      326.65276       0.285649     0.753776   13.681666"
       ]
      },
-     "execution_count": 14,
+     "execution_count": 18,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -470,16 +479,16 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": 19,
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
-       "[<matplotlib.lines.Line2D at 0x7fb1dfa7d390>]"
+       "[<matplotlib.lines.Line2D at 0x7fbbed162ed0>]"
       ]
      },
-     "execution_count": 15,
+     "execution_count": 19,
      "metadata": {},
      "output_type": "execute_result"
     },
@@ -534,7 +543,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 16,
+   "execution_count": 20,
    "metadata": {},
    "outputs": [],
    "source": [
@@ -552,7 +561,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": 21,
    "metadata": {},
    "outputs": [],
    "source": [
@@ -577,7 +586,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 22,
    "metadata": {},
    "outputs": [
     {
@@ -768,42 +777,42 @@
        "</style>\n",
        "\n",
        "<div class=\"animation\">\n",
-       "  <img id=\"_anim_imgedabb6cec1704a6d87bd3e03e786884e\">\n",
+       "  <img id=\"_anim_imgb37864aad89e4299bb82476aa20efc66\">\n",
        "  <div class=\"anim-controls\">\n",
-       "    <input id=\"_anim_slideredabb6cec1704a6d87bd3e03e786884e\" type=\"range\" class=\"anim-slider\"\n",
+       "    <input id=\"_anim_sliderb37864aad89e4299bb82476aa20efc66\" type=\"range\" class=\"anim-slider\"\n",
        "           name=\"points\" min=\"0\" max=\"1\" step=\"1\" value=\"0\"\n",
-       "           oninput=\"animedabb6cec1704a6d87bd3e03e786884e.set_frame(parseInt(this.value));\"></input>\n",
+       "           oninput=\"animb37864aad89e4299bb82476aa20efc66.set_frame(parseInt(this.value));\"></input>\n",
        "    <div class=\"anim-buttons\">\n",
-       "      <button title=\"Decrease speed\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.slower()\">\n",
+       "      <button title=\"Decrease speed\" onclick=\"animb37864aad89e4299bb82476aa20efc66.slower()\">\n",
        "          <i class=\"fa fa-minus\"></i></button>\n",
-       "      <button title=\"First frame\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.first_frame()\">\n",
+       "      <button title=\"First frame\" onclick=\"animb37864aad89e4299bb82476aa20efc66.first_frame()\">\n",
        "        <i class=\"fa fa-fast-backward\"></i></button>\n",
-       "      <button title=\"Previous frame\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.previous_frame()\">\n",
+       "      <button title=\"Previous frame\" onclick=\"animb37864aad89e4299bb82476aa20efc66.previous_frame()\">\n",
        "          <i class=\"fa fa-step-backward\"></i></button>\n",
-       "      <button title=\"Play backwards\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.reverse_animation()\">\n",
+       "      <button title=\"Play backwards\" onclick=\"animb37864aad89e4299bb82476aa20efc66.reverse_animation()\">\n",
        "          <i class=\"fa fa-play fa-flip-horizontal\"></i></button>\n",
-       "      <button title=\"Pause\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.pause_animation()\">\n",
+       "      <button title=\"Pause\" onclick=\"animb37864aad89e4299bb82476aa20efc66.pause_animation()\">\n",
        "          <i class=\"fa fa-pause\"></i></button>\n",
-       "      <button title=\"Play\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.play_animation()\">\n",
+       "      <button title=\"Play\" onclick=\"animb37864aad89e4299bb82476aa20efc66.play_animation()\">\n",
        "          <i class=\"fa fa-play\"></i></button>\n",
-       "      <button title=\"Next frame\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.next_frame()\">\n",
+       "      <button title=\"Next frame\" onclick=\"animb37864aad89e4299bb82476aa20efc66.next_frame()\">\n",
        "          <i class=\"fa fa-step-forward\"></i></button>\n",
-       "      <button title=\"Last frame\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.last_frame()\">\n",
+       "      <button title=\"Last frame\" onclick=\"animb37864aad89e4299bb82476aa20efc66.last_frame()\">\n",
        "          <i class=\"fa fa-fast-forward\"></i></button>\n",
-       "      <button title=\"Increase speed\" onclick=\"animedabb6cec1704a6d87bd3e03e786884e.faster()\">\n",
+       "      <button title=\"Increase speed\" onclick=\"animb37864aad89e4299bb82476aa20efc66.faster()\">\n",
        "          <i class=\"fa fa-plus\"></i></button>\n",
        "    </div>\n",
-       "    <form title=\"Repetition mode\" action=\"#n\" name=\"_anim_loop_selectedabb6cec1704a6d87bd3e03e786884e\"\n",
+       "    <form title=\"Repetition mode\" action=\"#n\" name=\"_anim_loop_selectb37864aad89e4299bb82476aa20efc66\"\n",
        "          class=\"anim-state\">\n",
-       "      <input type=\"radio\" name=\"state\" value=\"once\" id=\"_anim_radio1_edabb6cec1704a6d87bd3e03e786884e\"\n",
+       "      <input type=\"radio\" name=\"state\" value=\"once\" id=\"_anim_radio1_b37864aad89e4299bb82476aa20efc66\"\n",
        "             checked>\n",
-       "      <label for=\"_anim_radio1_edabb6cec1704a6d87bd3e03e786884e\">Once</label>\n",
-       "      <input type=\"radio\" name=\"state\" value=\"loop\" id=\"_anim_radio2_edabb6cec1704a6d87bd3e03e786884e\"\n",
+       "      <label for=\"_anim_radio1_b37864aad89e4299bb82476aa20efc66\">Once</label>\n",
+       "      <input type=\"radio\" name=\"state\" value=\"loop\" id=\"_anim_radio2_b37864aad89e4299bb82476aa20efc66\"\n",
        "             >\n",
-       "      <label for=\"_anim_radio2_edabb6cec1704a6d87bd3e03e786884e\">Loop</label>\n",
-       "      <input type=\"radio\" name=\"state\" value=\"reflect\" id=\"_anim_radio3_edabb6cec1704a6d87bd3e03e786884e\"\n",
+       "      <label for=\"_anim_radio2_b37864aad89e4299bb82476aa20efc66\">Loop</label>\n",
+       "      <input type=\"radio\" name=\"state\" value=\"reflect\" id=\"_anim_radio3_b37864aad89e4299bb82476aa20efc66\"\n",
        "             >\n",
-       "      <label for=\"_anim_radio3_edabb6cec1704a6d87bd3e03e786884e\">Reflect</label>\n",
+       "      <label for=\"_anim_radio3_b37864aad89e4299bb82476aa20efc66\">Reflect</label>\n",
        "    </form>\n",
        "  </div>\n",
        "</div>\n",
@@ -813,9 +822,9 @@
        "  /* Instantiate the Animation class. */\n",
        "  /* The IDs given should match those used in the template above. */\n",
        "  (function() {\n",
-       "    var img_id = \"_anim_imgedabb6cec1704a6d87bd3e03e786884e\";\n",
-       "    var slider_id = \"_anim_slideredabb6cec1704a6d87bd3e03e786884e\";\n",
-       "    var loop_select_id = \"_anim_loop_selectedabb6cec1704a6d87bd3e03e786884e\";\n",
+       "    var img_id = \"_anim_imgb37864aad89e4299bb82476aa20efc66\";\n",
+       "    var slider_id = \"_anim_sliderb37864aad89e4299bb82476aa20efc66\";\n",
+       "    var loop_select_id = \"_anim_loop_selectb37864aad89e4299bb82476aa20efc66\";\n",
        "    var frames = new Array(153);\n",
        "    \n",
        "  frames[0] = \"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAABaAAAAGwCAYAAABfOcJAAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90\\\n",
@@ -166302,7 +166311,7 @@
        "    /* set a timeout to make sure all the above elements are created before\n",
        "       the object is initialized. */\n",
        "    setTimeout(function() {\n",
-       "        animedabb6cec1704a6d87bd3e03e786884e = new Animation(frames, img_id, slider_id, 100.0,\n",
+       "        animb37864aad89e4299bb82476aa20efc66 = new Animation(frames, img_id, slider_id, 100.0,\n",
        "                                 loop_select_id);\n",
        "    }, 0);\n",
        "  })()\n",
@@ -166312,7 +166321,7 @@
        "<IPython.core.display.HTML object>"
       ]
      },
-     "execution_count": 18,
+     "execution_count": 22,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -166389,29 +166398,29 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 19,
+   "execution_count": 23,
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
-      "Full Matrix Profile: [1.22 1.51 1.49 1.78 1.93 1.57 1.94 1.77 1.76 2.26 2.21 2.28 1.79 1.87\n",
-      " 1.6  1.86 1.97 1.43 1.11 1.49 1.94 2.67 2.41 1.79 1.87 1.87 1.57 1.94\n",
-      " 1.77 1.76 1.87 2.2  2.5  1.65 2.32 2.43 1.91 2.18 1.22 1.11 2.19 1.6\n",
-      " 2.03 2.65 1.65 2.05 1.91 2.1  2.22 1.62 0.88 1.78 1.93 2.05 1.62 0.88\n",
-      " 2.24]\n",
-      "Left Matrix Profile: [ inf  inf  inf 4.5  3.89 3.65 3.54 3.25 3.03 2.85 3.18 2.82 2.78 2.85\n",
-      " 2.73 2.21 2.28 1.43 1.51 1.49 1.94 2.74 2.88 1.79 1.87 1.95 1.57 1.94\n",
-      " 1.77 1.76 1.87 2.2  2.5  2.43 2.32 2.47 2.17 2.18 1.22 1.11 2.19 1.6\n",
-      " 2.03 2.65 1.65 2.34 1.91 2.1  2.22 2.16 2.16 1.78 1.93 2.05 1.62 0.88\n",
-      " 2.24]\n",
-      "Full Matrix Profile Indices: [38 18 19 51 52 26 27 28 29 30 15 16 23 24 41 26 27  0 39  2  3 48 44 12\n",
-      " 13 30  5  6  7  8 25 24 11 44 28 49 46 16  0 18 13 14 25 33 33 53 36 27\n",
-      "  0 54 55  3  4 45 49 50  3]\n",
-      "Left Matrix Profile Indices: [-1 -1 -1  0  0  0  3  4  5  6  3  4  7  8  9 10 11  0  1  2  3  0  1 12\n",
-      " 13  4  5  6  7  8 25 24 11 12 28 29 15 16  0 18 13 14 25 33 33 33 36 27\n",
-      "  0  1  2  3  4 45 49 50  3]\n"
+      "Full Matrix Profile: [2.   1.73 1.8  2.14 2.21 2.12 2.07 2.12 2.14 1.9  2.17 1.43 2.07 2.04\n",
+      " 1.88 1.91 1.8  2.14 2.21 2.12 1.43 2.09 2.15 2.17 1.84 2.22 1.25 1.39\n",
+      " 1.73 1.97 2.12 2.07 2.2  2.12 2.16 1.78 2.04 1.88 1.91 2.29 2.15 1.25\n",
+      " 1.39 1.9  1.65 2.08 2.31 2.41 2.17 2.09 2.15 2.12 1.65 1.78 2.16 1.99\n",
+      " 1.85]\n",
+      "Left Matrix Profile: [ inf  inf  inf 4.32 4.07 4.19 4.7  2.85 2.44 3.64 3.1  3.11 2.07 2.12\n",
+      " 2.04 2.22 1.8  2.14 2.21 2.12 1.43 2.7  3.05 2.58 2.34 2.27 2.23 2.\n",
+      " 1.73 2.41 2.42 2.07 2.2  2.29 2.23 2.42 2.04 1.88 1.91 2.29 2.15 1.25\n",
+      " 1.39 1.9  1.84 2.2  2.31 2.41 2.17 2.09 2.15 2.12 1.65 1.78 2.16 1.99\n",
+      " 1.85]\n",
+      "Full Matrix Profile Indices: [27 28 16 17 18 19 12 13 14 43 48 20  6 36 37 38  2  3  4  5 11 49 50 51\n",
+      " 44 40 41 42  1 37 38 16  5 51 52 53 13 14 15  5  6 26 27  9 52 56 36 19\n",
+      " 10 21 22 33 44 35 51 44 53]\n",
+      "Left Matrix Profile Indices: [-1 -1 -1  0  1  2  1  0  0  1  6  7  6  7  0  1  2  3  4  5 11  0  1 10\n",
+      " 11 12  6  0  1  9  3 16  5 10 11 12 13 14 15  5  6 26 27  9 24 21 36 19\n",
+      " 10 21 22 33 44 35 51 44 53]\n"
      ]
     }
    ],