Software Testing and Quality Assurance for Data Intensive applications

(Tutorials)

Objective

The objectives of this tutorial are:

  • Building awareness about the current advancements in data intensive application and software testing strategies.
  • Highlighting the tools, techniques, and applications from the perspective of future intensive applications.
  • Providing a platform to participants for one-to-one interaction.

Target audience

The tutorial is intended to draw attention of the young minds and data professionals towards software testing and quality assurance strategies for data intensive applications with the help of existing tools and techniques.

Timeliness

Data intensive applications are one of the most critical aspects in real-time applications which is desired in most of the new-normal practices such as recommendation systems, social media analytics systems, fake news detection systems, etc. However, to deploy such solutions for real-time usage, software testing and quality assurance plays a vital role to understand the application behavior.

Motivation

With the advancements in technology various real-time applications are developed with big data analytics solutions. The efficient function of such data intensive solutions is very critical and hence it is necessary to perform desired software testing and get quality assurance before it is put into use. Following this motivation, this tutorial aims to unfold the various testing and quality assurance strategies along with the awareness about existing tools and techniques that can be utilized for various use cases.

Table of Contents

  1. Software Testing
    1. Software testing basics, dimensions, scope
    2. Level of Testing (Component, Integration, System)
    3. Software Testing Processes: Planning, Test-Case Design (Criteria-based, Human Knowledge-Based), Test Automation, Test Execution, Static Analysis, Test Evaluation (Oracle)
    4. Software testing types (Security, Performance, Conformance, Usability etc.) – Software testing artifacts (SQL Query, System Code, User Form etc.)
  2. Software Quality Assurance and Quality Control
    1. Definitions, quality of design and conformance, verifications, validations
    2. Audits, reviews, inspection and benchmarking
  3. Data Intensive Applications
    1. Characteristics: Reliability, maintainability, scalability, availability, concurrency, etc.
    2. Challenges: Management, integration, evolution, geographically distributed, context-aware, real time, etc.
  4. Tools for Data Intensive applications
    1. Hadoop, Spark, Flink, NoSQL, Hive, Zookeeper, Elastic Search, Flume, Kafka
    2. Testing Data Intensive applications
  5. Use cases of data intensive applications especially categorized into:
    1. Huge storage capacities to accommodate “data-at-rest” for store once and use it forever
    2. Capabilities to speed up the reads operations by using caches
    3. Allowing powerful search and filters
    4. Handling data in motion under stream processing
    5. Periodically exercising a large amount of accumulated data under batch processing

Speaker's Profile

Dr. Sonali Agarwal is working as an Associate Professor in the Information Technology Department of Indian Institute of Information Technology (IIIT), Allahabad, India. She received her Ph. D. Degree at IIIT Allahabad and joined as faculty at IIIT Allahabad, where she has been teaching since October 2009. She holds Bachelor of Engineering (B.E.) degree in Electrical Engineering from Bhilai Institute of Technology, Bhilai, (C.G.) India and Masters of Engineering (M.E.) degree in Computer Science from Motilal Nehru National Institute of Technology (MNNIT), Allahabad, India Her main research interests are in the areas of Artificial Intelligence and Big Data. She is the head of Big Data Analytics Lab at IIIT Allahabad, India.

Dr. Sanjay Kumar Sonbhadra is working as Teaching Research Fellow (TRF) in the Information Technology Department of Indian Institute of Information Technology (IIIT), Allahabad, India. His research is mainly working on One Class Classification, Anomaly detection, Dimensionality reduction techniques for target specific mining. He is a senior member of Big Data Analytics Lab at IIIT Allahabad, India. Sanjay has published many articles in the area of machine learning applications to address recent challenges of COVID-19.

Mr. Narinder Singh Punn is working as Senior Research Fellow in the Information Technology Department of Indian Institute of Information Technology (IIIT), Allahabad, India. Narinder’s main research includes Medical Imaging segmentation, Deep learning and Artificial Intelligence techniques in healthcare. He is a senior member of Big Data Analytics Lab at IIIT Allahabad, India. His recent publications cover applications of deep learning in detection and prevention of COVID-19.