Building a data store with the dynamic structure
Modelirovanie i analiz informacionnyh sistem, Tome 23 (2016) no. 2, pp. 93-118.

Voir la notice de l'article provenant de la source Math-Net.Ru

This article presents the analysis of approaches to data warehouse construction based on relational and NoSQL solutions and lists the limitations of the relational approach to data mining. The contradiction between data presentation in the real subject domain and the model of data presentation in the relational and NoSQL approaches is revealed. The revealed contradiction is related to the temporality of the values of individual data attributes, the variability of the composition of these attributes, and structure of connections between them. A new logical model of the data warehouse with dynamic structure is proposed. The model is based on the concept of the object as a container for properties storage. Each property of the object includes the property name and two property values — without reference and with reference, that are relevant at a given time. The reference property value points to an object whose name is interpreted as the value of the property at a given time. A formal description of the model with allocation of the necessary functionality to manipulate objects and their properties (selectors, predicates, constructors) is given and the necessary control structures are introduced. Substantiation of the proposed model, called an OP-model is given on the basis of compliance with the logical ER data model. It is proved that any ER data model can be implemented in the OP-model. At the same time, the advantages of the OP-model are indicated, they are associated with the possibility of changing connections between entities due to changes in the reference value at a particular time. The potential for scalability of data warehouse due to the unique identification of each object is noted.
Keywords: NoSQL, Big Data, ER model, Databases, DBMS.
@article{MAIS_2016_23_2_a0,
     author = {Yu. N. Artamonov},
     title = {Building a data store with the dynamic structure},
     journal = {Modelirovanie i analiz informacionnyh sistem},
     pages = {93--118},
     publisher = {mathdoc},
     volume = {23},
     number = {2},
     year = {2016},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MAIS_2016_23_2_a0/}
}
TY  - JOUR
AU  - Yu. N. Artamonov
TI  - Building a data store with the dynamic structure
JO  - Modelirovanie i analiz informacionnyh sistem
PY  - 2016
SP  - 93
EP  - 118
VL  - 23
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MAIS_2016_23_2_a0/
LA  - ru
ID  - MAIS_2016_23_2_a0
ER  - 
%0 Journal Article
%A Yu. N. Artamonov
%T Building a data store with the dynamic structure
%J Modelirovanie i analiz informacionnyh sistem
%D 2016
%P 93-118
%V 23
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MAIS_2016_23_2_a0/
%G ru
%F MAIS_2016_23_2_a0
Yu. N. Artamonov. Building a data store with the dynamic structure. Modelirovanie i analiz informacionnyh sistem, Tome 23 (2016) no. 2, pp. 93-118. http://geodesic.mathdoc.fr/item/MAIS_2016_23_2_a0/

[1] Barsegjan A. A., Tehnologii analiza dannyh: Data Mining, Visual Mining, Text Mining, OLAP, BHV-Peterburg, SPb, 2007, 384 pp. (in Russian)

[2] Dejt K. Dzh., Vvedenie v sistemy baz dannyh, Viljams, M., 2001 (in Russian)

[3] Martin J., Computer data-base organization, IBM Systems Research Institute, New Jersey, 1977, 665 pp. (in Russian)

[4] Konnolli T., Bazy dannyh: proektirovanie, realizacija i soprovozhdenie: Teorija i praktika, Viljams, M., 2003, 1440 pp. (in Russian)

[5] List Of NoSQL Databases, http://nosql-database.org/

[6] Marcos Kawazoe Aguilera, Carole Delporte-Gallet, Hugues Fauconnier, Sam Toueg, “Communication-efficient leader election and consensus with limited link synchrony”, Proceedings of the International Symposium on Principles of Distributed Computing (PODC) (2004), 328–337 | Zbl

[7] Herlihy M., Shavit N., “The topological structure of asynchronous computability”, Journal of the ACM, 46:6 (1999), 858–923 | DOI | MR | Zbl

[8] Haifeng Y., Amin V., “The costs and limits of availability for replicated services”, ACM Transactions on Computer Systems, 24:1 (2006), 70–113 | DOI

[9] Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana Yerneni, “Pnuts: Yahoo!'s hosted data serving platform”, PVLDB, 1:2 (2008), 1277–1288

[10] Swati Ahirrao, Rajesh Ingle, “Scalable transactions in cloud data stores”, Journal of Cloud Computing: Advances, Systems and Applications, 4:21 (2015), 1–14

[11] In-memory data structure store Redis, http://redis.io/

[12] MongoDB Professional with Cloud Manager, https://www.mongodb.org/

[13] A Database for the Web CouchDB, http://couchdb.apache.org/

[14] Pisarenko D. S., Rublev V. S., “Object DBMS DIM and its main concepts”, Modeling and Analysis of Information Systems, 16:1 (2009), 62–91 (in Russian)

[15] Rublev V. S., “The object query language of the dynamic information model DIM”, Modeling and Analysis of Information Systems, 17:3 (2010), 144–161 (in Russian) | MR

[16] Roublev V. S., “Evolution of DBMS DIM Database Schemes”, Modeling and Analysis of Information Systems, 19:2 (2012), 97–108 (in Russian)

[17] Antonov D. V., Roublev V. S., “Access Efficiency to Data in DIM DBMS”, Modeling and Analysis of Information Systems, 22:2 (2015), 158–175 (in Russian)

[18] Petrov A. N., Roublev V. S., “Completeness of the Dynamics of the Attributes Values of Data in the Database DIM”, Modeling and Analysis of Information Systems, 22:2 (2015), 259–277 (in Russian) | MR

[19] Roublev V. S., “Static completeness of the dynamic information model”, Automatic control and computer sciences, 49:3 (2015), 167–176 | DOI

[20] A Comprehensive Data Integration and Business Analytics Platform, http://www.pentaho.com/

[21] Data Mining Software in Java, http://www.cs.waikato.ac.nz/ml/weka/

[22] Doug H., Let Over Lambda, 2010, 384 pp.

[23] Biliris A., “An Efficient Database Storage Structure for Large Dynamic Objects”, Proceedings of the International Conference on Data Engineering (Phoenix, Arizona, 1992), 301–308

[24] Poltavtsev A. A., “Dynamic structures in relation databases”, Software Systems, 2:110 (2015), 95–97 (in Russian) | DOI

[25] Tsikritzis D., Lokhovski F., Modeli dannykh, Finansy i statistika, M., 1985, 168 pp. (in Russian)

[26] Kalinichenko L. A., Metody i sredstva integratsii neodnorodnykh baz dannykh, Nauka, M., 1983, 424 pp. (in Russian)