Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data

Knitting is a vital sector of the fabric manufacturing industry. Concurrently, machine learning is emerging as a highly regarded technique for predicting patterns and classifying various parameters derived from datasets. This study aims to establish a comprehensive database of knitted fabrics that e...

Full description

Saved in:

Bibliographic Details
Main Authors:	Toufique Ahmed, Abu Saleh Muhammad Junayed
Format:	Article
Language:	English
Published:	Elsevier 2025-08-01
Series:	Data in Brief
Subjects:	Data processing GSM Python Random forest Tightness factor
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352340925005979
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1839582649684852736
author	Toufique Ahmed Abu Saleh Muhammad Junayed
author_facet	Toufique Ahmed Abu Saleh Muhammad Junayed
author_sort	Toufique Ahmed
collection	DOAJ
description	Knitting is a vital sector of the fabric manufacturing industry. Concurrently, machine learning is emerging as a highly regarded technique for predicting patterns and classifying various parameters derived from datasets. This study aims to establish a comprehensive database of knitted fabrics that encompasses a variety of parameters related to yarn types, machinery, and fabric characteristics. The raw data was collected from a knitting factory, after which the dataset was processed with various pre-processing techniques using domain knowledge and the Python programming. These techniques included data cleaning, normalization, and feature engineering, all of which were crucial in ensuring the quality and usability of the dataset. Drawing on expertise in knitting science, several new parameters were formulated, and specific complex parameters were subsequently deconstructed into two or three distinct components. The finalized dataset has 12569 rows and 38 columns. This article also discusses potential applications of the dataset, such as identifying a polynomial relationship between grams per square meter (GSM) and yarn count for single jersey fabrics, having an R² score of 0.77. Furthermore, a quadratic relationship between the tightness factor and stitch length was observed, with an R² score of 0.78. Among various machine learning models to predict GSM, Random Forest and XGBoost consistently outperformed across all metrics (R² score, Mean Absolute Error, and Mean Square Error).
format	Article
id	doaj-art-08ee1f66436b4de89e8985a8d2bc9909
institution	Matheson Library
issn	2352-3409
language	English
publishDate	2025-08-01
publisher	Elsevier
record_format	Article
series	Data in Brief
spelling	doaj-art-08ee1f66436b4de89e8985a8d2bc99092025-08-04T04:24:32ZengElsevierData in Brief2352-34092025-08-0161111873Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley DataToufique Ahmed0Abu Saleh Muhammad Junayed1Dept. of Textile Engineering, Faculty of Engineering, Daffodil international University, Dhaka-1216, Bangladesh; Corresponding author.Knitting Section, Fakhruddin Textile Mills Limited, Gazipur, BangladeshKnitting is a vital sector of the fabric manufacturing industry. Concurrently, machine learning is emerging as a highly regarded technique for predicting patterns and classifying various parameters derived from datasets. This study aims to establish a comprehensive database of knitted fabrics that encompasses a variety of parameters related to yarn types, machinery, and fabric characteristics. The raw data was collected from a knitting factory, after which the dataset was processed with various pre-processing techniques using domain knowledge and the Python programming. These techniques included data cleaning, normalization, and feature engineering, all of which were crucial in ensuring the quality and usability of the dataset. Drawing on expertise in knitting science, several new parameters were formulated, and specific complex parameters were subsequently deconstructed into two or three distinct components. The finalized dataset has 12569 rows and 38 columns. This article also discusses potential applications of the dataset, such as identifying a polynomial relationship between grams per square meter (GSM) and yarn count for single jersey fabrics, having an R² score of 0.77. Furthermore, a quadratic relationship between the tightness factor and stitch length was observed, with an R² score of 0.78. Among various machine learning models to predict GSM, Random Forest and XGBoost consistently outperformed across all metrics (R² score, Mean Absolute Error, and Mean Square Error).http://www.sciencedirect.com/science/article/pii/S2352340925005979Data processingGSMPythonRandom forestTightness factor
spellingShingle	Toufique Ahmed Abu Saleh Muhammad Junayed Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data Data in Brief Data processing GSM Python Random forest Tightness factor
title	Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data
title_full	Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data
title_fullStr	Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data
title_full_unstemmed	Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data
title_short	Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data
title_sort	structuring a textile knitting dataset for machine learning and data mining applicationsmendeley data
topic	Data processing GSM Python Random forest Tightness factor
url	http://www.sciencedirect.com/science/article/pii/S2352340925005979
work_keys_str_mv	AT toufiqueahmed structuringatextileknittingdatasetformachinelearninganddataminingapplicationsmendeleydata AT abusalehmuhammadjunayed structuringatextileknittingdatasetformachinelearninganddataminingapplicationsmendeleydata

Structuring a textile knitting dataset for machine learning and data mining applicationsMendeley Data

Similar Items