THE CRUCIAL ROLE OF DATA QUALITY IN AUTOMATED DECISION-MAKING SYSTEMS

Tirupathi Rao Bammidi, Leela Manush Gutta, Anudeep Kotagiri, Laxmi Srinivas Samayamantri, Rama krishna Vaddy

Abstract


As organizations increasingly rely on automated decision-making systems to derive insights and streamline operations, the importance of data quality becomes paramount. This paper delves into the critical role that data quality plays in the efficacy and reliability of automated decision-making processes. Recognizing that the outputs of these systems are only as accurate as the data they process, the study explores the challenges, best practices, and strategies for ensuring high-quality data within automated decision-making frameworks. The paper begins by elucidating the fundamental connection between data quality and the performance of automated decision-making systems, emphasizing how inaccuracies or biases in the input data can propagate and magnify within the automated decision-making process. It delves into the impact of poor data quality on decision outcomes, operational efficiency, and overall organizational effectiveness.Drawing from established literature and real-world case studies, the study highlights the challenges organizations face in maintaining data quality within the context of automated decision-making. It explores common sources of data errors, such as inaccuracies, incompleteness, and inconsistency, and their potential ramifications on decision accuracy and reliability. These practices encompass data governance frameworks, validation protocols, and continuous monitoring strategies. The study advocates for a proactive approach to data quality management, emphasizing the need for organizations to invest in robust processes and technologies that ensure the accuracy, completeness, and relevance of their data. In inference, the paper underscores that successful implementation of automated decision making systems hinges on the establishment and maintenance of high data quality standards. It urges organizations to view data quality as an integral component of their decision-making infrastructure and provides insights into mitigating risks associated with poor data quality. By embracing a comprehensive and proactive approach to data quality, organizations can optimize the performance and reliability of their automated decision-making systems, thereby enhancing their capacity to make informed and impactful decisions in an increasingly automated landscape.

Full Text:

PDF

References


Wang, C., Zhang, L., & Smith, J. (2018). "Data Quality Challenges in Automated Decision-Making

Systems." Journal of Data Science, 10(2), 123-145.

Chen, Y., & Wang, H. (2020). "Impact of Data Quality on Decision Outcomes: An Empirical Study."

Information Systems Research, 30(4), 589-612.

Jones, A., & Smith, B. (2019). "Operational Efficiency and Data Quality: A Case Study Analysis."

Journal of Business Analytics, 5(3), 187-206.

Dr.Naveen Prasadula (2022). "Ethical Implications of Data Quality Issues in Automated Decision

Making." Ethics and Information Technology, 23(1), 45-67.

Smith, D., & Brown, M. (2017). "Best Practices for Data Quality Assurance in Automated Decision

Making Systems." International Journal of Information Management, 37(5), 412-428.

Zhang, Q., Li, X., & Kim, S. (2019). "Validation Protocols for Ensuring Data Quality in Automated

Decision-Making." Journal of Computer Science and Technology, 34(6), 1123-1140.

Li, J., & Kim, Y. (2022). "Emerging Trends: Explainable AI Techniques in Data Quality Management."

Journal of Artificial Intelligence Research, 45, 567-589.

Brown, C., & Miller, E. (2018). "Data Governance Frameworks: A Comprehensive Review." Journal of

Data Management, 12(4), 321-345.

Chen, Z., Liu, W., & Wang, G. (2020). "Continuous Monitoring Strategies for Data Quality in Automated

Decision-Making." Information Systems Frontiers, 22(3), 567-589.

Garcia, M., & Rodriguez, P. (2019). "The Economic Implications of Data Quality in Automated

Decision-Making Systems." Journal of Economics and Management, 15(1), 89-107.

Dr.Naveen Prasadula (2023)Review of Literature on The crucial role of data quality in automated

decision-making systems

Mel Dixon.The cost of bad data: have you done the math? Global Marketing Alliance, 2020.

https://www.the-gma.com/ the-cost-of-bad-data-have-you-done-the-math.

Precisely Editor. What is a metadata and how is it used?, November 2022.

https://www.precisely.com/blog/datagovernance/what-is-metadata.

L. Ehrlinger, A. Gindlhumer, L.-M. Huber, and W. Wöß. Dq-meerkat: Automating data quality

monitoring with a reference-data-profile-annotated knowledge graph. Proceedings of the 10th

International Conference on Data Science, Technology and Applications, 2021.

P. Ezenkwu and A. Starkey. Machine autonomy: Definition, approaches, challenges and research gaps.

In Advances in Intelligent Systems and Computing, pages 335–358. Computing Conference, 2019.

Lisa Ehrlinger and Wolfram Wöß. A survey of data quality measurement and monitoring tools. Frontiers

in Big Data, 5, 2022.

Experian.What is a data reconciliation?, 2023. https://www.experian.co.uk/business/glossary/data

reconciliation/.

W. Fan, S. Han, Y. Wang, and M. Xie. Parallel rule discovery from large datasets by sampling. In

Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 384–398.

SIGMOD ’22, June 2022.

Best data quality tools, 2023. https://www.g2.com/categories/ data-quality.

Gartner. The best dq tools over all, 2023. https://www.gartner.com/ reviews/market/data-quality

solutions.

Geekflare. The best data quality tools, April 2023. https://geekflare. com/best-data-quality-tools/.

V. Goasdoué, S. Nugier, D. Duquennoy, and B. Laboisse. An evaluation framework for data quality

tools. In Proceedings of the 2007 International Conference on Information Quality. International

Conference for Informa- tion Quality, January 2007.

BIS (Grooper).The 9 best dataquality tools 2023, Jan- uary 2023. https://blog.bisok.com/general

technology/ a-review-of-the-5-top-data-cleansing-tools.

Abiteboul, S.; Duschka, O.: “Complexity of answering queries using materialized views”. In

Proc. of the 1998 ACM Int. Symposium on Principles of Database Systems (PODS’98), USA, 1998.

Acharia, S.; Gibbons, P.B.; Poosala, V., Ramaswamy, S.: “Join synopses for approximate query

answering”. In Proc. of the 1999 ACM Int. Conf. on Management of Data (SIGMOD’99), Philadelphia,

USA, 1999.

Altareva, E.: “Improving Integration Quality for Heterogeneous Data Sources”. PhD Thesis,

Universität Düsseldorf , 2004.

Amat, G. ; Laboisse, B.: “B.D.Q.S. Une gestion opérationnelle de la qualité de données”. 1st

workshop on Data and Knowledge Quality (DKQ’2005), Paris, France, 2005.

Ballou, D.; Pazer, H.: “Modeling data and process quality in multi-input, multi-output

information systems”. Management Science, Vol. 31 (2): 150–162, Feb. 1985.

Ballou, D.; Wang, R.; Pazer, H.; Tayi, G.: “Modelling Information Manufacturing Systems to

Determine Information Product Quality”. Management Science, Vol. 44 (4), April 1998.

Ballou, D.; Pazer, H.: “Modeling Completeness versus Consistency Tradeoffs in Information

Decision Contexts”. IEEE Transactions on Knowledge Data Engineering (KDE’2003), Vol. 15(1):

-243, 2003.

Baralis, E.; Paraboschi, S. Teniente, E.: “Materialized view selection in a multidimensional

database”. In Proc. of the 23rd Int. Conf. on Very Large Databases (VLDB’97), Athens, Greece, 1997.

Berti-Equille, L.: “Un état de l'art sur la qualité des données”. Ingénierie des systèmes

d’information (ISI), Hermès, Vol. 9(5-6) :117-143, 2004

Bobrowski, M.; Marré, M.; Yankelevich, D.: “A Software Engineering View of Data Quality”. 2nd Int.

Software Quality Week Europe (QWE'98), Brussels, Belgium, 1998.

Bouzeghoub, M.; Fabret, F.; Matulovic-Broqué, M.: “Modeling Data Warehouse

Refreshment Process as a Workflow Application”. In Proc. of the Int. Workshop on Design and

Management of Data Warehouses (DMDW’99), Heidelberg, Germany, 1999.

Bouzeghoub, M.; Kedad, Z.: “Quality in Data Warehousing”. Information and database

quality, Piattini, M.; Calero, C.; Genero, M. (eds), Kluwer Academic Publisher, 2002.

Bouzeghoub, M.; Peralta, V.: “A Framework for Analysis of Data Freshness” In Proc. of

the 1st Int. Workshop on Information Quality in Information Systems (IQIS’2004), Paris, France,

Braumandl, R.: “Quality of Service and Optimization in Data Integration Systems”. In Proc.

Of GI-Fachtagung Datenbanksysteme für Business, Technologie und Web (BTW'2003), Leipzig,

Germany, 2003.

Breiman, I.; Friedman, J.; Olshen, R.; Stone, C.: “Clasification and Regression Trees”.

Wadsworth International Group, 1984.

Bright, L.; Raschid, L.: "Using Latency-Recency Profiles for Data Delivery on the Web". In Proc.

of the 28th Int. Conf. on Very Large Databases (VLDB'02), Hong Kong, China, 2002.

Bruno, N.; Chaudhuri, S.: “Exploiting Statistics on Query Expressions for Optimization”. In Proc.

of the 2002 ACM Int. Conf. on Management of Data (SIGMOD’2002) , Madison, USA, 2002.

Calvanese, D.; De Giacomo, G.; Lenzerini, M.; Nardi, D.; Rosati, R.: “A principled Approach

to Data Integration and Reconciliation to Data Warehousing”. In Proc. of the Int. Workshop on

Design and Management of Data Warehouses (DMDW’99), Heidelberg, Germany, 1999.


Refbacks

  • There are currently no refbacks.