Logo PTI
Polish Information Processing Society
Logo FedCSIS

Annals of Computer Science and Information Systems, Volume 8

Proceedings of the 2016 Federated Conference on Computer Science and Information Systems

The Column-oriented Data Store Performance Considerations

, ,

DOI: http://dx.doi.org/10.15439/2016F486

Citation: Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, M. Ganzha, L. Maciaszek, M. Paprzycki (eds). ACSIS, Vol. 8, pages 877881 ()

Full text

Abstract. The massive amounts of data processed by information systems raise the importance of detailed database performance analysis. Column-oriented data stores are becoming increasingly popular in big data appliances. This paper identifies database performance factors on the basis of empirical studies on a custom implementation. To summarize the research, a simple performance mathematical model has been created.

References

  1. A. Nowosielski, P. A. Kowalski, and P. Kulczycki, “The column-oriented database partitioning optimization based on the natural computing algorithms,” in 2015 Federated Conference on Computer Science and Information Systems, FedCSIS 2015, Łódź, Poland, September 13-16, 2015, 2015. http://dx.doi.org/10.15439/2015F262 pp. 1035–1041.
  2. M. Stonebraker, D. J. Abadi, A. Batkin, X. Chen, M. Cherniack, M. Ferreira, E. Lau, A. Lin, S. Madden, E. O’Neil, P. O’Neil, A. Rasin, N. Tran, and S. Zdonik, “C-store: a column-oriented DBMS,” VLDB Conference, pp. 553–564, 2005. http://dx.doi.org/10.1007/BF02443652. [Online]. Available: http://dl.acm.org/citation.cfm?id=1083592.1083658
  3. A. Lübcke, “Challenges in workload analyses for column and row stores,” CEUR Workshop Proceedings, vol. 581, pp. 5–8, 2010.
  4. E. F. Codd, “A relational model of data for large shared data banks,” Commun. ACM, vol. 26, no. 6, pp. 64–69, 1983. doi: 10.1145/357980.358007.
  5. P. Svensson, “On search performance for conjunctive queries in compressed, fully transposed ordered files,” in Very Large Data Bases, 1979. Fifth International Conference on. IEEE, 1979, pp. 155–163.
  6. K. Grolinger, W. a. Higashino, A. Tiwari, and M. A. Capretz, “Data management in cloud environments: NoSQL and NewSQL data stores,” Journal of Cloud Computing: Advances, Systems and Applications, vol. 2, p. 22, 2013. http://dx.doi.org/10.1186/2192-113X-2-22. [Online]. Available: http://www.journalofcloudcomputing.com/content/2/1/22
  7. D. Abadi, “The Design and Implementation of Modern Column-Oriented Database Systems,” Foundations and Trends{®} in Databases, vol. 5, no. 3, pp. 197–280, 2012. http://dx.doi.org/10.1561/1900000024.
  8. D. J. Abadi, S. R. Madden, and N. Hachem, “Column-Stores vs. Row-Stores: How Different Are They Really?” Sigmod, vol. June 9-12, pp. 967–980, 2008. http://dx.doi.org/10.1145/1376616.1376712
  9. D. J. Abadi, D. S. Myers, D. J. DeWitt, and S. R. Madden, “Ma- terialization strategies in a column-oriented DBMS,” in Proceedings - International Conference on Data Engineering, 2007. http://dx.doi.org/10.1109/ICDE.2007.367892. ISBN 1424408032. ISSN 10844627 pp. 466–475.
  10. S. Eyerman and L. Eeckhout, “System-level performance metrics for multiprogram workloads,” IEEE Micro, vol. 28, no. 3, pp. 42–53, 2008. http://dx.doi.org/10.1109/MM.2008.44
  11. M. Woodside, G. Franks, and D. C. Petriu, “The Future of Software Performance Engineering,” in Future of Software Engineering (FOSE ’07), 2007. http://dx.doi.org/10.1109/FOSE.2007.32. ISBN 0-7695-2829-5 pp. 171–187. [Online]. Available: http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4221619
  12. Oracle, “Java Platform, Standard Edition HotSpot Virtual Machine Garbage Collection Tuning Guide,” 2016.