Data management helps in optimizing the use of data so that decisions made and actions taken lead to maximum benefit. A robust data management system is very important to on-line system, big data, and critical infrastructures. Without data management, it will lead to incompatible data sets, inconsistent data sets, data quality problems, etc. that will delay projects making integrating very tedious.
Using some of Wikipedia [link], some of the major topics we picked and modified in data management article include:
In big data management, data is generally store and process in a data lake or warehouse preferably using object storage for its efficiency, security and reliability.
Data Warehouse
Data warehouses are typical relational database that store structured data from various sources. The mini version of it are the data marts that houses smaller version of data warehouses for local usage.
Data Lake
Data lakes are huge bulk of data that typically store raw data in NoSQL databases and object storage.
Extracted from Oracle [link]:
Examples:
Notable points to note in cloud databases are
ETL is the short form of Extract, Transform, and Load in data lakes processes. Some of these processes run at scheduled intervals for batch processing, while other smaller processes run in real time.
Examples:
Examples:
Huberman, A. M., & Miles, M. B. (1994). Data management and analysis methods. In N. K. Denzin & Y. S. Lincoln (Eds.), Handbook of qualitative research (p. 428–444). Sage Publications, Inc.
Acharya S., Alonso R., Franklin M., Zdonik S. (1995). Broadcast Disks: Data Management for Asymmetric Communication Environments. In Mobile Computing. The Kluwer International Series in Engineering and Computer Science, vol 353. Springer, Boston, MA.
Clifford Lynch , "How do your data grow?," Nature, vol 455, no. 7209, pp. 28–29, Feb. 2008. DOI: 10.1038/455028a
Sam Madden, "From Databases to Big Data," IEEE Internet Computing, vol. 16, no. 3, pp. 4-6, Apr. 2012. DOI: 10.1109/MIC.2012.50
Jinchuan Chen, Yueguo Chen, Xiaoyong Du, Cuiping Li, Jiaheng Lu, Suyun Zhao, and Xuan Zhou, "Big data challenge: a data management perspective," Frontiers of Computer Science, vol. 7, no. 2, pp. 157–164, Apr. 2013. DOI: 10.1007/s11704-013-3903-7
Note: Limited read page count on external host website
Data science strategy, information management, knowledge management, I2R data management, automation, research, science, Singapore
#data #management #information #knowledge #NTU #singapore #limjunlong