-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathdata_sets.html
254 lines (143 loc) · 9.7 KB
/
data_sets.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
<!DOCTYPE html>
<!--[if IE 8]><html class="no-js lt-ie9" lang="en" > <![endif]-->
<!--[if gt IE 8]><!--> <html class="no-js" lang="en" > <!--<![endif]-->
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Data sets — MultiCategory 0.0.1 documentation</title>
<script type="text/javascript" src="_static/js/modernizr.min.js"></script>
<script type="text/javascript" id="documentation_options" data-url_root="./" src="_static/documentation_options.js"></script>
<script src="_static/jquery.js"></script>
<script src="_static/underscore.js"></script>
<script src="_static/doctools.js"></script>
<script src="_static/language_data.js"></script>
<script type="text/javascript" src="_static/js/theme.js"></script>
<link rel="stylesheet" href="_static/css/theme.css" type="text/css" />
<link rel="stylesheet" href="_static/pygments.css" type="text/css" />
<link rel="index" title="Index" href="genindex.html" />
<link rel="search" title="Search" href="search.html" />
<link rel="next" title="Reference" href="reference.html" />
<link rel="prev" title="Tutorial" href="tutorial.html" />
</head>
<body class="wy-body-for-nav">
<div class="wy-grid-for-nav">
<nav data-toggle="wy-nav-shift" class="wy-nav-side">
<div class="wy-side-scroll">
<div class="wy-side-nav-search" >
<a href="index.html" class="icon icon-home"> MultiCategory
</a>
<div role="search">
<form id="rtd-search-form" class="wy-form" action="search.html" method="get">
<input type="text" name="q" placeholder="Search docs" />
<input type="hidden" name="check_keywords" value="yes" />
<input type="hidden" name="area" value="default" />
</form>
</div>
</div>
<div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="main navigation">
<p class="caption"><span class="caption-text">Contents:</span></p>
<ul class="current">
<li class="toctree-l1"><a class="reference internal" href="install.html">Install</a></li>
<li class="toctree-l1"><a class="reference internal" href="tutorial.html">Tutorial</a></li>
<li class="toctree-l1 current"><a class="current reference internal" href="#">Data sets</a><ul>
<li class="toctree-l2"><a class="reference internal" href="#simple-e-commerce-data-sets">Simple E-commerce data sets</a></li>
<li class="toctree-l2"><a class="reference internal" href="#data-sets-from-helsinki-multi-model-data-repository">Data sets from Helsinki multi-model data repository</a></li>
<li class="toctree-l2"><a class="reference internal" href="#unibench">Unibench</a></li>
<li class="toctree-l2"><a class="reference internal" href="#online-market-place-under-development">Online Market Place (under development)</a><ul class="simple">
</ul>
</li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="reference.html">Reference</a></li>
<li class="toctree-l1"><a class="reference internal" href="theoretical_background.html">Theoretical background</a></li>
<li class="toctree-l1"><a class="reference internal" href="license.html">Lisence</a></li>
<li class="toctree-l1"><a class="reference internal" href="citing.html">Citing</a></li>
<li class="toctree-l1"><a class="reference internal" href="bibliography.html">Bibliography</a></li>
</ul>
</div>
</div>
</nav>
<section data-toggle="wy-nav-shift" class="wy-nav-content-wrap">
<nav class="wy-nav-top" aria-label="top navigation">
<i data-toggle="wy-nav-top" class="fa fa-bars"></i>
<a href="index.html">MultiCategory</a>
</nav>
<div class="wy-nav-content">
<div class="rst-content">
<div role="navigation" aria-label="breadcrumbs navigation">
<ul class="wy-breadcrumbs">
<li><a href="index.html">Docs</a> »</li>
<li>Data sets</li>
<li class="wy-breadcrumbs-aside">
<a href="_sources/data_sets.rst.txt" rel="nofollow"> View page source</a>
</li>
</ul>
<hr/>
</div>
<div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
<div itemprop="articleBody">
<div class="section" id="data-sets">
<h1>Data sets<a class="headerlink" href="#data-sets" title="Permalink to this headline">¶</a></h1>
<p>MultiCategory comes with six different multi-model data sets: simple e-commerce data set, Patent, Film, University, Person, and Unibench data set.</p>
<p>If you download the Haskell version of MultiCategory, you will download all the data sets simultaneously. Notice that the data sets have been modified and cleaned certain ways
because Haskell is very strict that data follows the lines that typing sets for it. We have also made data sets smaller because originally the idea was not to create a big data platform.</p>
<p>On the other hand, the Python version does not come with most of the data. Python version is developed to support bigger data sets and this means that all the data sets can be uploaded into the demo system as they are presented. You can read more about installing the data sets from the <a class="reference external" href="install.html">Install</a> tab.</p>
<div class="section" id="simple-e-commerce-data-sets">
<h2>Simple E-commerce data sets<a class="headerlink" href="#simple-e-commerce-data-sets" title="Permalink to this headline">¶</a></h2>
<p>This data set is a very small e-commerce data set. It contains</p>
<ul class="simple">
<li><p>a property graph of customer –knows–> customer,</p></li>
<li><p>a table of locations and</p></li>
<li><p>an XML document of Orders and Products.</p></li>
</ul>
<p>The data set suffices for demonstrating category theoretical ideas but more comprehensive and real-life motivated data sets are implemented or are under implementation.</p>
</div>
<div class="section" id="data-sets-from-helsinki-multi-model-data-repository">
<h2>Data sets from Helsinki multi-model data repository<a class="headerlink" href="#data-sets-from-helsinki-multi-model-data-repository" title="Permalink to this headline">¶</a></h2>
<p>You can find schemas and detailed descriptions in Helsinki multi-model data repository <a class="bibtex reference internal" href="bibliography.html#udms-dataset" id="id1">[LCZ18]</a>.</p>
</div>
<div class="section" id="unibench">
<h2>Unibench<a class="headerlink" href="#unibench" title="Permalink to this headline">¶</a></h2>
<p>Unibench <a class="bibtex reference internal" href="bibliography.html#id4" id="id2">[ZLXC19]</a> is a benchmark for multi-model databases. It comes along a data set that has been slightly scaled for this demo.</p>
</div>
<div class="section" id="online-market-place-under-development">
<h2>Online Market Place (under development)<a class="headerlink" href="#online-market-place-under-development" title="Permalink to this headline">¶</a></h2>
<p>This data set is an expanded version of the simple E-commerce data set. It consists of an online market place such as Amazon, eBay, or Alibaba. This online market place collects different sellers together which requires novel techniques to combine
different data models together. Currently, this demo data set has three third-party sellers which are using relational, JSON, and XML models and formats to store their product and invoice information. The online market place implements a multi-model database
based on category theory which demonstrates how multi-model query processing and multi-model joins can be applied in this demonstration scenario.</p>
<p>The online market place demonstration scenario includes:</p>
<ul class="simple">
<li><p>property graph of customer – friends with –> customer (Facebook friendship graph from <a class="bibtex reference internal" href="bibliography.html#snapnets" id="id3">[LK14]</a>)</p></li>
<li><p>RDF graphs of capitals and countries in separate graphs (<a class="reference external" href="http://telegraphis.net/data/">Telegraphis</a>)</p></li>
<li><p>Each third-party seller has information on their products and invoices stored either in relational tables, JSON objects, or XML trees. Contents of the data for each third-party sellers vary. (Amazon product metadata from <a class="bibtex reference internal" href="bibliography.html#snapnets" id="id4">[LK14]</a>)</p></li>
</ul>
<div class="toctree-wrapper compound">
</div>
</div>
</div>
</div>
</div>
<footer>
<div class="rst-footer-buttons" role="navigation" aria-label="footer navigation">
<a href="reference.html" class="btn btn-neutral float-right" title="Reference" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right"></span></a>
<a href="tutorial.html" class="btn btn-neutral float-left" title="Tutorial" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left"></span> Previous</a>
</div>
<hr/>
<div role="contentinfo">
<p>
© Copyright 2020, Valter Uotila
</p>
</div>
Built with <a href="http://sphinx-doc.org/">Sphinx</a> using a <a href="https://github.com/rtfd/sphinx_rtd_theme">theme</a> provided by <a href="https://readthedocs.org">Read the Docs</a>.
</footer>
</div>
</div>
</section>
</div>
<script type="text/javascript">
jQuery(function () {
SphinxRtdTheme.Navigation.enable(true);
});
</script>
</body>
</html>