The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


A Real-Time Business Analysis Framework Using Virtual Data Warehouse

Data Warehouse (DW) is widely used in industries over decades to perform the analysis on data to expedite decision-making process. However, the traditional DW is slower in execution due to the huge time overhead of pre-processing stages of Extraction-Transformation-Loading (ETL). On the other hand, often the situations arise where the decision-making are required in real time. Data virtualization is one of the robust approaches over traditional data warehouse that avoids the costly steps of ETL processing. Virtual Data Warehouse (VDW) allows specific analysis for quick decision making even on the unprocessed data. Moreover, VDW could be used by the organizations that maintain DW to take some immediate business decisions for some abrupt changes. This research work performs business trend specific analysis based on VDW to generate business intelligence even in the catastrophic situations. Experimental results reveal, the proposed methodology based on VDW is around thousand times faster than traditional warehouses.


[1] Aggarwal V., Gupta V., Singh P., Sharma K., and Sharma N., “Detection of Spatial Outlier by Using Improved Z-Score Test,” in Proceedings of International Conference on Trends in Electronics and Informatics, Tirunelveli, pp. 788- 790, 2019.

[2] Amazon Review Data. Link: https://nijianmo.github.io/amazon/index.html., Last Visited, 2020.

[3] Asadullaev S., Data Warehouse Architectures and Development Strategy, IBM Companion Guidebook, Open Group, 2015.

[4] Azvine B., Cui Z., Nauck D., and Majeed B., “Real Time Business Intelligence for the Adaptive Enterprise,” in Proceedings of 8th IEEE International Conference on E-Commerce Technology and The 3rd IEEE International Conference on Enterprise Computing, E- Commerce, and E-Services, San Francisco, pp. 29-29, 2006.

[5] Chandramouly A., Patil N., Ramamurthy R., Krishnan S., and Story J., Integrating Data Warehouses with Data Virtualization for BI Agility, Intel White Paper, 2013.

[6] Conn S., “OLTP and OLAP Data Integration: A Review of Feasible Implementation Methods and Architectures for Real Time Data Analysis,” in Proceedings of IEEE SoutheastCon, Ft. Lauderdale, pp. 515-520, 2005.

[7] Dahmani D., Rahal S., and Belalem G., “A New Approach to Improve Association Rules for Big Data in Cloud Environment,” The International Arab Journal of Information Technology, vol. 16, no. 6, pp. 1013-1020, 2019.

[8] Davis R. and Eve R., Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility, Nine Five One Press, United States, 2011.

[9] Ghosh P., Sadhu D., Sen S., and Debnath N., “Service Modelling for Virtual Data Warehouse,” in Proceedings of International Conference on Computer Applications in Industry and Engineering, San Diego, 2017.

[10] Ghosh P., Som S., and Sen S., “Business Intelligence Development by Analysing Customer Sentiment,” in Proceedings of IEEE International Conference on Reliability, Infocom Technologies and Optimization, Noida, pp. 287- 290, 2018.

[11] Goss R. and Veeramuthu K., “Heading Towards Big Data Building A Better Data Warehouse For More Data, More Speed, And More Users,” in Proceedings of Advanced Semiconductor Manufacturing Conference, Saratoga, pp. 220- 225, 2013.

[12] Guo S., Yuan Z., Sun A., and Yue Q., “A new ETL Approach Based On Data Virtualization,” Journal of Computer Science and Technology, vol. 30, pp. 311-323, 2015.

[13] Gupta A. and Sahayadhas A., “Proposed Techniques to Optimize the DW and ETL Query for Enhancing Data Warehouse Efficiency,” in Proceedings of International Conference on Computing, Communication and Security, Patna, pp. 1-5, 2020.

[14] Katkar V., Gangopadhyay S., Rathod S., and Shetty A., “Sales Forecasting Using Data Warehouse and Naïve Bayesian Classifier,” in Proceedings of International Conference on Pervasive Computing, Pune, pp. 1-6, 2015.

[15] Kholod I., Efimova M., and Kulikov S., “Using ETL Tools for Developing a Virtual Data Warehouse,” in Proceedings of IEEE International Conference on Soft Computing and A Real-Time Business Analysis Framework Using Virtual Data Warehouse 595 Measurements, St. Petersburg, pp. 351-354, 2016.

[16] Lans R., Data Virtualization for Business Intelligence Systems: Revolutionizing Data Integration for Data Warehouses, Morgan Kaufmann, 2012.

[17] Mousa A., Shiratuddin N., and Bakar M., “Virtual Data Mart for Measuring Organizational Achievement Using Data Virtualization Technique (KPIVDM),” Jurnal Teknologi, vol. 68, no. 3, pp. 67-70, 2014.

[18] Ni J., Li J., and McAuley J., “Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects,” in Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, pp. 188-197, 2019.

[19] Sahay B. and Ranjan J., “Real Time Business Intelligence in Supply Chain Analytics,” Information Management and Computer Security, vol. 16, no. 1, pp. 28-48, 2008.

[20] Salem R., Salesh S., and AbdulKader H., “Intelligent Replication for Distributed Active Real-Time Databases Systems,” The International Arab Journal of Information Technology, vol. 15, no. 3, pp. 505-519, 2018.

[21] Shukla A., Kumar A., and Singh H., “ANN Based Execution Time Prediction Model and Assessment of Input Parameters through ISM,” The International Arab Journal of Information Technology, vol. 17, no. 5, pp. 683-691, 2020. Partha Ghosh is pursuing his Ph.D. from University of Calcutta, Kolkata, India. Also, he is working as an Assistant Professor in the Computer Applications Department at B. P. Poddar Institute of Management & Technology, Saltlake Campus, Kolkata, India. His research areas are Data Warehouse, Big data, Machine Learning and Business intelligence. Deep Sadhu is working as a Data Scientist at EY (Ernst & Young), dealing with Machine Learning, Deep Learning & Big Data techniques. Soumya Sen is an Assistant professor in A. K. Choudhury School of Information Technology under University of Calcutta, Kolkata, India. His research areas are Data warehouse & OLAP tools, Distributed database, Big data, Machine learning.