ML-Assisted Pattern Recognition for UTS Estimation in FDM PLA Specimens

1. Introduction

Artificial Intelligence (AI) and Machine Learning (ML) are revolutionizing manufacturing, offering unprecedented capabilities for process optimization and predictive analytics. In the domain of Additive Manufacturing (AM), specifically Fused Deposition Modeling (FDM), predicting mechanical properties like Ultimate Tensile Strength (UTS) is critical for ensuring part reliability and expanding industrial applications. This study pioneers the application of supervised classification algorithms—Logistic Classification, Gradient Boosting, Decision Tree, and K-Nearest Neighbor (KNN)—to estimate the UTS of Polylactic Acid (PLA) specimens. By correlating key process parameters (Infill Percentage, Layer Height, Print Speed, Extrusion Temperature) with tensile strength outcomes, this research aims to establish a data-driven framework for quality prediction in FDM, reducing reliance on costly and time-consuming physical testing.

2. Methodology & Experimental Setup

The research methodology was structured around a controlled experiment followed by computational analysis.

31

PLA Specimens Fabricated

4

Key Input Parameters

4

ML Algorithms Evaluated

2.1. Specimen Fabrication & Parameters

A total of 31 PLA specimens were fabricated using an FDM 3D printer. The experimental design varied four critical process parameters, which served as the feature set for the ML models:

Infill Percentage: Density of the internal structure.
Layer Height: Thickness of each deposited layer.
Print Speed: Speed of the extruder head.
Extrusion Temperature: Temperature of the molten filament.

The UTS of each specimen was measured through standard tensile testing, creating a labeled dataset for supervised learning.

2.2. Machine Learning Algorithms

Four distinct supervised classification algorithms were implemented to predict UTS class (e.g., high vs. low strength). The target variable (UTS) was likely discretized into classes for classification.

Logistic Classification: A linear model for binary classification.
Gradient Boosting Classification: An ensemble technique that builds sequential trees to correct errors.
Decision Tree: A tree-like model of decisions based on feature values.
K-Nearest Neighbor (KNN): A non-parametric, instance-based learning algorithm.

Model performance was evaluated using metrics like F1 Score and Area Under the Curve (AUC).

3. Results & Analysis

3.1. Algorithm Performance Comparison

The study yielded a clear hierarchy in algorithm performance for this specific task. Both the Decision Tree and K-Nearest Neighbor algorithms achieved an identical F1 score of 0.71, indicating a similar balance between precision and recall. However, the KNN algorithm demonstrated superior discriminative power with a higher Area Under the Curve (AUC) score of 0.79, outperforming the Decision Tree and the other two algorithms (Logistic and Gradient Boosting).

3.2. K-Nearest Neighbor Superiority

The higher AUC score for KNN signifies its enhanced ability to distinguish between the two classes of ultimate tensile strength across all classification thresholds. This suggests that for the given dataset—characterized by four manufacturing parameters and a likely non-linear, complex relationship with UTS—the local, distance-based reasoning of KNN was more effective than the global rules learned by the Decision Tree or the linear/logistic boundaries. The result underscores the importance of algorithm selection tailored to the data's inherent structure.

Chart Interpretation (Conceptual): A hypothetical Receiver Operating Characteristic (ROC) curve plot would show the KNN curve arching closer to the top-left corner (AUC=0.79) compared to the other algorithms, visually confirming its superior classification performance. The Decision Tree curve would lie slightly below, sharing a similar F1 score point but with less overall area under the curve.

4. Technical Framework & Mathematical Formulation

The core of the KNN algorithm's decision for a new data point $\mathbf{x}_{\text{new}}$ (defined by its four FDM parameters) is based on a distance metric (commonly Euclidean) and a voting mechanism among its $k$ nearest neighbors in the feature space.

Euclidean Distance: The distance between the new point and a training point $\mathbf{x}_i$ is calculated as: $$d(\mathbf{x}_{\text{new}}, \mathbf{x}_i) = \sqrt{\sum_{j=1}^{4} (x_{\text{new},j} - x_{i,j})^2}$$ where $j$ indexes the four input features (Infill %, Layer Height, etc.).

Classification Rule: After identifying the $k$ training specimens with the smallest distances to $\mathbf{x}_{\text{new}}$, the UTS class (e.g., 'High') is assigned by majority vote: $$\text{Class}(\mathbf{x}_{\text{new}}) = \arg\max_{c \in \{\text{High, Low}\}} \sum_{i \in \mathcal{N}_k} I(y_i = c)$$ where $\mathcal{N}_k$ is the set of indices for the $k$ nearest neighbors, $y_i$ is the true class of the $i$-th neighbor, and $I$ is the indicator function.

The optimal value of $k$ is typically determined through cross-validation to avoid overfitting (small $k$) or oversmoothing (large $k$).

5. Analytical Framework: A Non-Code Case Study

Consider a manufacturer aiming to print a functional PLA bracket requiring a minimum UTS of 45 MPa. Instead of printing dozens of test coupons, they can use the trained KNN model as a digital twin.

Input Query: The engineer proposes a set of parameters: {Infill: 80%, Layer Height: 0.2 mm, Speed: 60 mm/s, Temp: 210°C}.
Model Inference: The KNN model ($k=5$) calculates the Euclidean distance between this query and all 31 specimens in the training database.
Neighbor Retrieval: It identifies the 5 historical prints with the most similar parameter sets.
Prediction & Decision: If 4 out of those 5 neighbors had a UTS classified as 'High' (>45 MPa), the model predicts 'High' for the new settings. The engineer gains high confidence to proceed. If the vote is 3-2 for 'Low', the engineer is alerted to adjust parameters (e.g., increase infill or temperature) before any physical print is made.

This framework transforms process optimization from a trial-and-error physical endeavor into a rapid, computational simulation.

6. Future Applications & Research Directions

The success of this study opens several avenues for advancement:

Multi-Material & Multi-Property Prediction: Extending the framework to other common AM materials (ABS, PETG, composites) and predicting a suite of properties (flexural strength, impact resistance, thermal conductivity) simultaneously.
Integration with Real-Time Process Monitoring: Coupling the ML model with in-situ sensors (e.g., infrared cameras, acoustic emission) for closed-loop control, as explored in projects like America Makes and the MIT Self-Assembling Systems Lab. This moves from post-hoc prediction to real-time correction.
Advanced ML Architectures: Employing deep learning models like Convolutional Neural Networks (CNNs) to analyze micro-CT scan images of prints for direct defect-property correlation, akin to methods used in medical image analysis.
Generative Design Inverse Problem: Flipping the model to act as a generative tool: inputting desired mechanical properties to output optimal printing parameter sets, accelerating the design-for-AM process.

7. Industry Analyst's Perspective

Core Insight: This paper isn't just about KNN beating a Decision Tree; it's a proof-of-concept that even relatively simple, interpretable ML models can capture the complex, non-linear physics of FDM well enough to make useful predictions. The real value proposition is the democratization of advanced simulation—bringing predictive analytics to SMEs and workshop floors without requiring a PhD in computational mechanics.

Logical Flow & Strengths: The authors' approach is pragmatic and clear: define a controlled experiment, extract features, test standard classifiers. The strength lies in its replicability and the clear, metrics-driven conclusion (AUC > F1 score for model selection). It effectively bridges the gap between materials science and data science.

Flaws & Critical Gaps: The elephant in the room is the tiny dataset (n=31). In the ML world, this is a pilot study, not a production-ready model. It risks overfitting and lacks robustness across different printers, filament batches, or environmental conditions. Furthermore, discretizing UTS into classes loses valuable continuous information; a regression approach (e.g., Gaussian Process Regression, Random Forest Regression) might have been more informative for engineering design.

Actionable Insights: For industry adopters: Start here, but don't stop here. Use this methodology to build your own proprietary dataset. For researchers: The next step must be scaling data acquisition through automation and exploring hybrid physics-informed neural networks (PINNs)—as highlighted in the seminal work by Raissi et al. (2019) on Journal of Computational Physics—which embed known physical laws (e.g., thermal stress equations) into the ML model. This hybrid approach, combining data-driven learning with domain knowledge, is the key to developing robust, generalizable, and trustworthy digital twins for additive manufacturing that can move from the lab to the factory floor.

8. References

Du, B., et al. (Year). Study on void formation in friction stir welded joints using decision tree and Bayesian neural network. Journal Title.
Hartl, R., et al. (Year). Application of Artificial Neural Networks in analyzing FSW process data. Journal Title.
Du, Y., et al. (Year). A synergistic approach combining physics-informed machine learning for defect mitigation in AM. Nature Communications.
Maleki, E., et al. (Year). ML-based methodology for fatigue life prediction in post-treated AM samples. International Journal of Fatigue.
Raissi, M., Perdikaris, P., & Karniadakis, G.E. (2019). Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 378, 686-707.
America Makes. (n.d.). Additive Manufacturing Research Portfolio. Retrieved from https://www.americamakes.us
MIT Self-Assembling Systems Lab. (n.d.). Research on Autonomous Manufacturing. Retrieved from http://selfassemblylab.mit.edu
Zhu, J.Y., Park, T., Isola, P., & Efros, A.A. (2017). Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. Proceedings of the IEEE International Conference on Computer Vision (ICCV). (Cited as an example of advanced generative ML frameworks).