GENERATIVE AI FOR DATA STEWARDS: ENHANCING ACCURACY AND EFFICIENCY IN DATA GOVERNANCE

Authors

  • Ankush Reddy Sugureddy Lead Engineer, Data Insights, Cloudflare Inc, Dallas, USA. Author

Keywords:

Generative AI, Stewards, Accuracy, Efficiency, Data Governance

Abstract

The quality of data becomes an essential component for the success of an organisation in a world that is largely influenced by data, where data analytics is becoming increasingly popular in the process of informing strategic decisions. The failure to improve the quality of the data can lead to undesirable outcomes such as poor decisions, ineffective strategies, dysfunctional operations, lost commercial prospects, and abrasion of the consumer. In the process of organisations shifting their focus towards transformative methods such as generative artificial intelligence, several use cases may emerge that have the potential to aid the improvement of data quality. Streamlining procedures such as data classification, metadata management, and policy enforcement can be accomplished by the incorporation of generative artificial intelligence into data governance frameworks. This, in turn, reduces the workload of human data stewards and minimises the possibility of human error. In order to ensure compliance with legal standards such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA), generative artificial intelligence may analyse enormous datasets by utilising machine learning algorithms to discover patterns, inconsistencies, and compliance issues.

References

Pretrained Models — Sentence-Transformers documentation." https://www.sbert.net/do cs/pretrained_models.html (accessed.

Abduljawad, M., & Alsalmani, A. (2022). Towards creating exotic remote sensing datasets using image generating AI. In 2022 international conference on electrical and computing technologies and applications (ICECTA) (pp. 84–88). IEEE.

Acumen Research and Consulting. "Generative AI Market Size Will Achieve USD 110.8 Billion by 2030 growing at 34.3% CAGR - Exclusive Report by Acumen Research and Consulting." https://www.globenewswire.com/news-release/2022/12/14/2574140 /0/en/Generative-AI-Market-Size-Will-Achieve-USD-110-8-Billion-by-2030-growing -at-34-3-CAGR-Exclusive-Report-by-Acumen-Research-and-Consulting.html (accessed).

Adewumi, A. O., & Akinyelu, A. A. (2017). A survey of machine-learning and natureinspired based credit card fraud detection techniques. International Journal of System Assurance Engineering and Management, 8, 937–953.

Afzal, M., Li, R. Y. M., Ayyub, M. F., Shoaib, M., & Bilal, M. (2023). Towards BIM-based sustainable structural design optimization: A systematic review and industry perspective. Sustainability, 15(20), Article 15117.

Ali, S., DiPaola, D., & Breazeal, C. (2021). What are GANs?: Introducing generative adversarial networks to middle school students. Proceedings of the AAAI Conference on Artificial Intelligence, 35(17), 15472–15479.

Alsharhan, A., Al-Emran, M., & Shaalan, K. (2023). Chatbot adoption: A multiperspective systematic review and future research agenda (pp. 1–13). IEEE Transactions on Engineering Management. https://doi.org/10.1109/TEM.2023.3298360

Andrews, J. T., et al. (2023). Ethical considerations for collecting human-centric image datasets. arXiv preprint arXiv:2302.03629.

Bianchi, F., Terragni, S., & Hovy, D. (2021). Pre-training is a hot topic: Contextualized document embeddings improve topic coherence, 2021/08//. In ACL-IJCNLP 2021 (pp. 759–766). Association for Computational Linguistics. https://doi.org/ 10.18653/v1/2021.acl-short.96 [Online]. Available: https://aclanthology.or g/2021.acl-short.96.

Ce Zhou, Q. L., Chen, L., Yu, J., Liu, Y., Wang, G., Zhang, K., Cheng, J., Yan, Q., He, L., Peng, H., Li, J., Wu, J., Liu, Z., Xie, P., Xiong, C., Pei, J., Yu, P. S., & Sun, L. (2023). A comprehensive survey on pretrained foundation models: A history from BERT to ChatGPT. Available: https://dx.doi.org/10.48550/arxiv.2302.09419.

Chien-Chang Lin, A. Y. Q. H., & Yang, S. J. H. (2023). A review of AI-driven converstational chatbots implementation methodologies and challenges (1999- 2022). Sustainability, 15. https://doi.org/10.3390/su15054012

Duong, D., & Solomon, B. D. (2023). Analysis of large-language model versus human performance for genetics questions. medRxiv. https://doi.org/10.1101/ 2023.01.27.23285115 (in eng).

Dwivedi, Y. K., et al. (2023). Opinion paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. International Journal of Information Management, 71, Article 102642. https://doi.org/10.1016/j. ijinfomgt.2023.102642

Ebert, C., & Louridas, P. (2023). Generative AI for software practitioners. IEEE Software, 40(4), 30–38. https://doi.org/10.1109/MS.2023.3265877

Eshraghian, J. K. (2020). Human ownership of artificial creativity. Nature Machine Intelligence, 2(3), 157–160. https://doi.org/10.1038/s42256-020-0161-x Ester, M., Kriegel, H.-P., Sander, J., & Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise (Vol. 96, pp. 226–231). and others.

Downloads

Published

2024-06-20

How to Cite

Ankush Reddy Sugureddy. (2024). GENERATIVE AI FOR DATA STEWARDS: ENHANCING ACCURACY AND EFFICIENCY IN DATA GOVERNANCE. INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT (IJAIRD), 2(1), 189-202. https://iaeme-library.com/index.php/IJAIRD/article/view/IJAIRD_02_01_016