{ "cells": [ { "cell_type": "markdown", "id": "2d17e9c1", "metadata": {}, "source": [ "# Hazard sessions\n", "\n", "For the practice of data transformation, we use data from hazard games.\n", "\n", "## Data\n", "Download *hazard.zip* unpack and read two files:\n", "\n", "* ***hra-upr.csv***: (BIG!) table of individual games playes by a gambler, sorted by gambler id and time\n", " - *df_v_misto_key*: place id \n", " - *df_v_konto_key*: gambler id\n", " - *df_v_herni_pozice_key*: machine id\n", " - *sazkavysepuvodni*: bet amount (in CZK)\n", " - *sazkaprijeticas*: exact date and time of playing a game\n", " - *vyhravysepuvodni*: winnings amount (in CZK)\n", " - *df_v_evidence_her_key*: game id\n", " - *zmena_konto_misto*: flag if the current record has different gambler or place id from the previous (change of place or start of records of another gambler)\n", " - *cas_pred*: time difference from previous record's *sazkaprijeticas*\n", "* ***misto-upr.csv***: table of places\n", " - *df_v_misto_key*: place id\n", " - *jtsk_x*, *jtsk_y*: coordinates in square localization system (different from GPS)\n", " - *obec*, *ulice*, *psc*, *cp*: address of the place\n", " - *kraj*: region of the Czech Republic\n", " - *typmisto*: place type\n", " - *sidlokodruian*: address code in the RUIAN register" ] }, { "cell_type": "code", "execution_count": 32, "id": "9ed66ea3", "metadata": {}, "outputs": [], "source": [ "### Setup\n", "%matplotlib inline\n", "# should enable plotting without explicit call .show()\n", "\n", "# Import libraries\n", "import pandas as pd\n", "import numpy as np\n", "import seaborn as sns\n", "import seaborn.objects as so\n", "import matplotlib.pyplot as plt" ] }, { "cell_type": "code", "execution_count": 33, "id": "90ec2dbc", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | df_v_misto_key | \n", "df_v_konto_key | \n", "df_v_herni_pozice_key | \n", "sazkavysepuvodni | \n", "sazkaprijeticas | \n", "vyhravysepuvodni | \n", "df_v_evidence_her_key | \n", "zmena_konto_misto | \n", "cas_pred | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "127018397 | \n", "31347675075 | \n", "157656900 | \n", "100 | \n", "2023-12-18T15:16:17Z | \n", "0 | \n", "507989000 | \n", "True | \n", "NaN | \n", "
1 | \n", "127018397 | \n", "31347675075 | \n", "157656900 | \n", "100 | \n", "2023-12-18T15:16:19.600Z | \n", "0 | \n", "507989000 | \n", "False | \n", "2.6 | \n", "
2 | \n", "127018397 | \n", "31347675075 | \n", "157656900 | \n", "100 | \n", "2023-12-18T15:16:22.100Z | \n", "0 | \n", "507989000 | \n", "False | \n", "2.5 | \n", "
3 | \n", "127018397 | \n", "31347675075 | \n", "157656900 | \n", "100 | \n", "2023-12-18T15:16:24.600Z | \n", "0 | \n", "507989000 | \n", "False | \n", "2.5 | \n", "
4 | \n", "127018397 | \n", "31347675075 | \n", "157656900 | \n", "100 | \n", "2023-12-18T15:16:27.100Z | \n", "0 | \n", "507989000 | \n", "False | \n", "2.5 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
7420165 | \n", "277179595 | \n", "79816767890 | \n", "160812221 | \n", "5 | \n", "2023-12-31T18:02:37Z | \n", "0 | \n", "420825383 | \n", "False | \n", "2.0 | \n", "
7420166 | \n", "277179595 | \n", "79816767890 | \n", "160812221 | \n", "5 | \n", "2023-12-31T18:02:40Z | \n", "0 | \n", "420825383 | \n", "False | \n", "3.0 | \n", "
7420167 | \n", "277179595 | \n", "79816767890 | \n", "160812221 | \n", "5 | \n", "2023-12-31T18:02:42Z | \n", "0 | \n", "420825383 | \n", "False | \n", "2.0 | \n", "
7420168 | \n", "277179595 | \n", "79816767890 | \n", "160812221 | \n", "5 | \n", "2023-12-31T18:02:44Z | \n", "0 | \n", "420825383 | \n", "False | \n", "2.0 | \n", "
7420169 | \n", "277179595 | \n", "79816767890 | \n", "160812221 | \n", "5 | \n", "2023-12-31T18:02:46Z | \n", "0 | \n", "420825383 | \n", "False | \n", "2.0 | \n", "
7420170 rows × 9 columns
\n", "\n", " | df_v_misto_key | \n", "typmisto | \n", "sidlokodruian | \n", "jtsk_y | \n", "jtsk_x | \n", "kraj | \n", "obec | \n", "psc | \n", "ulice | \n", "cp | \n", "
---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "145526315 | \n", "H | \n", "NaN | \n", "NaN | \n", "NaN | \n", "LBK | \n", "Semily | \n", "51301 | \n", "Komenského nám. | \n", "119 | \n", "
1 | \n", "258252794 | \n", "H | \n", "NaN | \n", "NaN | \n", "NaN | \n", "JHM | \n", "Vyškov | \n", "68201 | \n", "Masarykovo náměstí | \n", "39 | \n", "
2 | \n", "271154701 | \n", "H | \n", "NaN | \n", "NaN | \n", "NaN | \n", "ZLK | \n", "Rožnov pod Radhoštěm | \n", "75661 | \n", "Svazarmovská | \n", "1682 | \n", "
3 | \n", "130950653 | \n", "H | \n", "951749.0 | \n", "824561.29 | \n", "1091140.86 | \n", "PLK | \n", "Stod | \n", "33301 | \n", "nám. ČSA | \n", "72 | \n", "
4 | \n", "130950654 | \n", "H | \n", "1498967.0 | \n", "763128.56 | \n", "1049512.75 | \n", "STC | \n", "Loděnice | \n", "26712 | \n", "Plzeňská | \n", "44 | \n", "
5 | \n", "127018394 | \n", "H | \n", "2771217.0 | \n", "517707.49 | \n", "1082628.66 | \n", "MSK | \n", "Úvalno | \n", "79391 | \n", "Úvalno | \n", "246 | \n", "
6 | \n", "130950651 | \n", "H | \n", "5541751.0 | \n", "773463.88 | \n", "1126872.63 | \n", "JHC | \n", "Mirovice | \n", "39806 | \n", "Masarykovo náměstí | \n", "44 | \n", "
7 | \n", "130950656 | \n", "H | \n", "5918901.0 | \n", "783349.25 | \n", "1061108.00 | \n", "STC | \n", "Žebrák | \n", "26753 | \n", "Náměstí | \n", "11 | \n", "
8 | \n", "130950647 | \n", "H | \n", "7655762.0 | \n", "632009.89 | \n", "1058808.04 | \n", "PAK | \n", "Pardubice | \n", "53002 | \n", "Teplého | \n", "1375 | \n", "
9 | \n", "127018395 | \n", "H | \n", "11545755.0 | \n", "700196.47 | \n", "1071764.05 | \n", "STC | \n", "Kutná Hora | \n", "28401 | \n", "Václavské náměstí | \n", "177 | \n", "
10 | \n", "127018397 | \n", "H | \n", "18167829.0 | \n", "464086.17 | \n", "1103583.85 | \n", "STC | \n", "Lysá nad Labem | \n", "28922 | \n", "Smetanova | \n", "789 | \n", "
11 | \n", "130950652 | \n", "H | \n", "20427751.0 | \n", "NaN | \n", "NaN | \n", "OLK | \n", "Javorník | \n", "79070 | \n", "nám. Svobody | \n", "415 | \n", "
12 | \n", "130950649 | \n", "H | \n", "23638907.0 | \n", "NaN | \n", "NaN | \n", "JHM | \n", "Zbýšov | \n", "66411 | \n", "Masarykova | \n", "54 | \n", "
13 | \n", "277179595 | \n", "H | \n", "26321823.0 | \n", "756748.40 | \n", "989637.48 | \n", "ULK | \n", "Litoměřice | \n", "41201 | \n", "Pokratická | \n", "1851 | \n", "
14 | \n", "127018396 | \n", "H | \n", "27911420.0 | \n", "719473.70 | \n", "949421.61 | \n", "JHM | \n", "Hustopeče | \n", "69301 | \n", "Dukelské nám. | \n", "1326 | \n", "