Modelling

Module for modelling materials test data.

This module provides functionalities to parameterize mechanical test data by fitting constitutive models to the data. It includes classes to fit mathematical models to materials test data and predict material behavior. The module integrates with the plug module for data handling.

Classes

ModelSet: Acts as a model DataSet, used to fit, collect, and predict using constitutive models.

`ModelSet`

Class that acts as a model DataSet, providing functionalities to fit, collect, and predict constitutive models for material behavior.

This class is designed to fit mathematical models to mechanical test data and make predictions based on the fitted parameters. It integrates with the DataSet and DataItem classes for handling data.

Attributes:

Name	Type	Description
`model_func`	`Callable`	A function defining the mathematical model to be fitted. It should accept an array of x-values and a tuple of variables and parameters, and return an array of y-values.
`variable_names`	`List[str]`	List of variable names that may be used in the model_func.
`param_names`	`List[str]`	List of parameter names for the model.
`bounds`	`List[Tuple[float, float]]`	Bounds for the model parameters.
`initial_guess`	`Tuple[float]`	Initial guess for the model parameters.
`sample_range`	`Tuple[float, float]`	Range of samples for fitting.
`sample_size`	`int`	Size of the sample data.
`model_id_key`	`str`	Key for the model ID.
`fitting_table`	`pd.DataFrame`	Pandas DataFrame storing the fitting results.

Examples:

>>> import paramaterial as pam
>>> from paramaterial import DataSet, DataItem, ModelSet
>>> model_func = pam.models.linear
>>> param_names = ['E', 's_y']
>>> ms = ModelSet(model_func=model_func, param_names=param_names)
>>> ds = DataSet(info_path='info.csv', data_dir='data/')
>>> ms.fit_to(ds, x_key='strain', y_key='stress', sample_range=(0.0, 0.05), sample_size=100)
>>> prediction_ds = ms.predict(xmin=0, xmax=0.05)

Source code in paramaterial\modelling.py

class ModelSet:
    """
    Class that acts as a model DataSet, providing functionalities to fit, collect, and predict constitutive models
    for material behavior.

    This class is designed to fit mathematical models to mechanical test data and make predictions based on the fitted
    parameters. It integrates with the DataSet and DataItem classes for handling data.

    Attributes:
        model_func (Callable): A function defining the mathematical model to be fitted. It should accept an array of
                               x-values and a tuple of variables and parameters, and return an array of y-values.
        variable_names (List[str]): List of variable names that may be used in the model_func.
        param_names (List[str]): List of parameter names for the model.
        bounds (List[Tuple[float, float]]): Bounds for the model parameters.
        initial_guess (Tuple[float]): Initial guess for the model parameters.
        sample_range (Tuple[float, float]): Range of samples for fitting.
        sample_size (int): Size of the sample data.
        model_id_key (str): Key for the model ID.
        fitting_table (pd.DataFrame): Pandas DataFrame storing the fitting results.

    Examples:
        >>> import paramaterial as pam
        >>> from paramaterial import DataSet, DataItem, ModelSet
        >>> model_func = pam.models.linear
        >>> param_names = ['E', 's_y']
        >>> ms = ModelSet(model_func=model_func, param_names=param_names)
        >>> ds = DataSet(info_path='info.csv', data_dir='data/')
        >>> ms.fit_to(ds, x_key='strain', y_key='stress', sample_range=(0.0, 0.05), sample_size=100)
        >>> prediction_ds = ms.predict(xmin=0, xmax=0.05)
    """

    def __init__(self,
                 model_func: Callable[[np.ndarray, Tuple[float]], np.ndarray],
                 var_names: List[str],
                 param_names: List[str],
                 bounds: List[Tuple[float, float]] = None,
                 initial_guess: Tuple[float] = None,
                 sample_range: Tuple[float, float] = (None, None),
                 sample_size: int = 50,
                 model_id_key: str = 'model_id',
                 scipy_func: str = 'minimize',
                 ):
        self.model_func = model_func
        self.variable_names = var_names  # Updated name
        self.param_names = param_names
        self.bounds = bounds
        self.initial_guess = initial_guess if initial_guess else [0.0] * len(param_names)
        self.sample_range = sample_range
        self.sample_size = sample_size
        self.model_id_key = model_id_key
        self.scipy_func = scipy_func
        # self.fitting_table: pd.DataFrame = pd.DataFrame(
        #     columns=[model_id_key] + ['var_' + var_name for var_name in var_names] +
        #             ['param_' + param_name for param_name in param_names] + ['error'])
        self.fitting_table: pd.DataFrame = pd.DataFrame(
            columns=[model_id_key] + ['var_' + var_name for var_name in var_names] +
                    [param_name for param_name in param_names] + ['error'])

    def fit_to(self, ds: DataSet, x_key: str, y_key: str, sample_range: Tuple[float, float] = (None, None),
               sample_size: int = 50, **scipy_method_kwargs):
        """
    Fits the model to a given DataSet using the specified x and y keys for the independent and dependent variables.

    Args:
        ds (DataSet): The DataSet containing the data to be fitted.
        x_key (str): The key for the independent variable (e.g., 'strain').
        y_key (str): The key for the dependent variable (e.g., 'stress').
        sample_range (Tuple[float, float], optional): The range of samples for fitting. Defaults to (None, None).
        sample_size (int, optional): The size of the sample data. Defaults to 50.
        **scipy_method_kwargs: Additional keyword arguments to pass to the SciPy optimization method.

    Returns:
        None: Updates the fitting_table attribute with the fitting results.

    Examples:
        >>> model_func = lambda x, args: args[0] * x + args[1]  # Example linear model
        >>> model_set = ModelSet(model_func, var_names=['a', 'b'], param_names=['slope', 'intercept'])
        >>> ds = DataSet()  # Assume this is a pre-loaded DataSet with 'strain' and 'stress' columns
        >>> model_set.fit_to(ds, x_key='strain', y_key='stress')
    """
        # Set the keys
        self.x_col = x_key
        self.y_col = y_key

        # Call the existing fit method
        self.fit_items(ds, sample_range, sample_size, self.scipy_func, **scipy_method_kwargs)

    def predict(self, x_range: Optional[Tuple[float, float, float]] = None,
                xmin: Optional[float] = None,
                xmax: Optional[float] = None,
                info_table: Optional[pd.DataFrame] = None,
                model_id_key: str = 'model_id'):
        """
        Makes predictions based on the fitted model over a specified x-range.

        Args:
            x_range (Tuple[float, float, float], optional): Range for prediction in the form (xmin, xmax, step).
            xmin (float, optional): Minimum x value for prediction.
            xmax (float, optional): Maximum x value for prediction.
            info_table (pd.DataFrame, optional): Information table containing parameters and variables.
            model_id_key (str, optional): Key for the model ID. Defaults to 'model_id'.

        Returns:
            DataSet: A DataSet containing the predicted values.

        Examples:
            >>> x_range = (0, 10, 0.1)  # Define x range for prediction
            >>> predicted_ds = model_set.predict(x_range=x_range)
        """
        # If x_range is not provided, check for xmin and xmax
        if x_range is None:
            if xmin is None or xmax is None:
                x_range = (0, 0.01, 0.0001)  # Example default value
            else:
                x_range = (xmin, xmax, 0.0001)  # Example step value, adjust as needed

        return self.predict_ds(x_range, info_table, model_id_key)

    def _sample_data(self, di: DataItem) -> Tuple[np.ndarray, np.ndarray]:
        sample_range = self.sample_range
        sample_size = self.sample_size
        x_data = di.data[self.x_col].values
        y_data = di.data[self.y_col].values
        if sample_range[0] is not None and sample_range[1] is not None:
            mask = (x_data > sample_range[0]) & (x_data < sample_range[1])
            x_data, y_data = x_data[mask], y_data[mask]
        sampling_stride = max(int(len(x_data) / sample_size), 1)
        x_data, y_data = x_data[::sampling_stride], y_data[::sampling_stride]
        return x_data, y_data

    def _objective_function(self, params: Tuple[float, ...], di: DataItem) -> float:
        x_data, y_data = self._sample_data(di)
        if self.variable_names is not None:
            variables = tuple(di.info[var_name] for var_name in self.variable_names)
        else:
            variables = ()
        variables_and_params = variables + tuple(params)
        y_model = self.model_func(x_data, variables_and_params)
        return _error_norm(y_data, y_model)

    def _fit_item(self, di: DataItem, scipy_method: str, **scipy_method_kwargs) -> op.OptimizeResult:
        return _call_scipy_method(scipy_method=scipy_method, initial_guess=self.initial_guess, bounds=self.bounds,
                                  objective_function=self._objective_function, storage_object=di, **scipy_method_kwargs)

    def fit_items(self, ds: DataSet, sample_range: Tuple[float, float] = (None, None), sample_size: int = 50,
                  scipy_method: str = 'minimize', **scipy_method_kwargs):

        self.sample_range = sample_range
        self.sample_size = sample_size
        # fit each DataItem and add a row to fitting_table for each
        fitting_dfs = []
        pad = int(np.log10(len(ds))) + 1
        for i, di in enumerate(ds):
            model_id = f'{self.model_id_key}_{i+1:0{pad}}'
            # run optimisation
            fitting_result = self._fit_item(di, scipy_method, **scipy_method_kwargs)
            # extract results
            params = fitting_result.x
            error = fitting_result.fun
            variables = di.info[self.variable_names]
            # add 'var_' prefix to variable names
            variables.index = 'var_' + variables.index
            # add 'param_' prefix to param names
            # params = pd.Series(params, index='param_' + pd.Series(self.param_names))
            params = pd.Series(params, index=pd.Series(self.param_names))
            # add row to fitting_table
            # concatenate data
            data = np.hstack([model_id, variables, params, error, di.info.to_list()]).reshape(1, -1)
            # define columns
            columns = self.fitting_table.columns.tolist() + di.info.index.to_list()
            # create DataFrame and append
            fitting_dfs.append(pd.DataFrame(data, columns=columns))
        # concatenate fitting_dfs into fitting_table
        self.fitting_table = pd.concat(fitting_dfs)

    def predict_ds(self, x_range: Tuple[float, float, float], info_table: Optional[pd.DataFrame] = None,
                   model_id_key: str = 'model_id'):

        if info_table is None:
            info_table = self.fitting_table
        # generate DataItems for each row of info_table
        model_items = []
        for model_id in info_table[model_id_key].to_list():
            di_info = info_table.loc[info_table[model_id_key] == model_id, :].squeeze()
            # extract variables and optimised params from info_table
            variables_keys = self.variable_names if self.variable_names else []
            variables_keys = ['var_' + var_key for var_key in variables_keys]
            # params_keys = ['param_' + param_key for param_key in self.param_names]
            params_keys = [param_key for param_key in self.param_names]
            variables_and_params = di_info[variables_keys + params_keys].to_list()
            # generate model data and create DataItem
            x_model = np.arange(*x_range)
            y_model = self.model_func(x_model, variables_and_params)
            data = pd.DataFrame({self.x_col: x_model, self.y_col: y_model})
            model_items.append(DataItem(model_id, data=data, info=di_info))
        # create DataSet from model_items and return
        ds = DataSet(test_id_key=model_id_key)
        ds.data_items = model_items
        ds.info_table = info_table
        return ds

`fit_to(ds, x_key, y_key, sample_range=(None, None), sample_size=50, **scipy_method_kwargs)`

Fits the model to a given DataSet using the specified x and y keys for the independent and dependent variables.

Parameters:

Name	Type	Description	Default
`ds`	`DataSet`	The DataSet containing the data to be fitted.	required
`x_key`	`str`	The key for the independent variable (e.g., 'strain').	required
`y_key`	`str`	The key for the dependent variable (e.g., 'stress').	required
`sample_range`	`Tuple[float, float]`	The range of samples for fitting. Defaults to (None, None).	`(None, None)`
`sample_size`	`int`	The size of the sample data. Defaults to 50.	`50`
`**scipy_method_kwargs`		Additional keyword arguments to pass to the SciPy optimization method.	`{}`

Returns:

Name	Type	Description
`None`		Updates the fitting_table attribute with the fitting results.

Examples:

>>> model_func = lambda x, args: args[0] * x + args[1]  # Example linear model
>>> model_set = ModelSet(model_func, var_names=['a', 'b'], param_names=['slope', 'intercept'])
>>> ds = DataSet()  # Assume this is a pre-loaded DataSet with 'strain' and 'stress' columns
>>> model_set.fit_to(ds, x_key='strain', y_key='stress')

Source code in paramaterial\modelling.py

def fit_to(self, ds: DataSet, x_key: str, y_key: str, sample_range: Tuple[float, float] = (None, None),
           sample_size: int = 50, **scipy_method_kwargs):
    """
Fits the model to a given DataSet using the specified x and y keys for the independent and dependent variables.

Args:
    ds (DataSet): The DataSet containing the data to be fitted.
    x_key (str): The key for the independent variable (e.g., 'strain').
    y_key (str): The key for the dependent variable (e.g., 'stress').
    sample_range (Tuple[float, float], optional): The range of samples for fitting. Defaults to (None, None).
    sample_size (int, optional): The size of the sample data. Defaults to 50.
    **scipy_method_kwargs: Additional keyword arguments to pass to the SciPy optimization method.

Returns:
    None: Updates the fitting_table attribute with the fitting results.

Examples:
    >>> model_func = lambda x, args: args[0] * x + args[1]  # Example linear model
    >>> model_set = ModelSet(model_func, var_names=['a', 'b'], param_names=['slope', 'intercept'])
    >>> ds = DataSet()  # Assume this is a pre-loaded DataSet with 'strain' and 'stress' columns
    >>> model_set.fit_to(ds, x_key='strain', y_key='stress')
"""
    # Set the keys
    self.x_col = x_key
    self.y_col = y_key

    # Call the existing fit method
    self.fit_items(ds, sample_range, sample_size, self.scipy_func, **scipy_method_kwargs)

`predict(x_range=None, xmin=None, xmax=None, info_table=None, model_id_key='model_id')`

Makes predictions based on the fitted model over a specified x-range.

Parameters:

Name	Type	Description	Default
`x_range`	`Tuple[float, float, float]`	Range for prediction in the form (xmin, xmax, step).	`None`
`xmin`	`float`	Minimum x value for prediction.	`None`
`xmax`	`float`	Maximum x value for prediction.	`None`
`info_table`	`pd.DataFrame`	Information table containing parameters and variables.	`None`
`model_id_key`	`str`	Key for the model ID. Defaults to 'model_id'.	`'model_id'`

Returns:

Name	Type	Description
`DataSet`		A DataSet containing the predicted values.

Examples:

>>> x_range = (0, 10, 0.1)  # Define x range for prediction
>>> predicted_ds = model_set.predict(x_range=x_range)

Source code in paramaterial\modelling.py

def predict(self, x_range: Optional[Tuple[float, float, float]] = None,
            xmin: Optional[float] = None,
            xmax: Optional[float] = None,
            info_table: Optional[pd.DataFrame] = None,
            model_id_key: str = 'model_id'):
    """
    Makes predictions based on the fitted model over a specified x-range.

    Args:
        x_range (Tuple[float, float, float], optional): Range for prediction in the form (xmin, xmax, step).
        xmin (float, optional): Minimum x value for prediction.
        xmax (float, optional): Maximum x value for prediction.
        info_table (pd.DataFrame, optional): Information table containing parameters and variables.
        model_id_key (str, optional): Key for the model ID. Defaults to 'model_id'.

    Returns:
        DataSet: A DataSet containing the predicted values.

    Examples:
        >>> x_range = (0, 10, 0.1)  # Define x range for prediction
        >>> predicted_ds = model_set.predict(x_range=x_range)
    """
    # If x_range is not provided, check for xmin and xmax
    if x_range is None:
        if xmin is None or xmax is None:
            x_range = (0, 0.01, 0.0001)  # Example default value
        else:
            x_range = (xmin, xmax, 0.0001)  # Example step value, adjust as needed

    return self.predict_ds(x_range, info_table, model_id_key)