Train

The default value can be found in DeepH-pack/deeph/default.ini. The following arguments can be set in the configuration file for Train:

basic

  • graph_dir : The directory to save the graph for the dataset.

  • save_dir : The root directory to save the training result.

  • raw_dir : The root directory of the preprocessed dataset.

  • dataset_name : The name of your dataset.

  • disable_cuda : Whether to disable the cuda during training.

  • device : The device you used for training (cpu or cuda:x, where x is the index of the cuda device). If the cuda is disabled by disable_cuda or it is not available in your environment, you may not set this option as it will be automatically set as ‘cpu’.

  • num_threads : The number of threads used for PyTorch on CPU.

  • save_to_time_folder : Whether to create a subfolder named with the current time in save_dir.

  • save_csv : Whether to output labels and predictions for all the structures in the format of csv.

  • tb_writer : Whether to track and visualize the training process by TensorBoard.

  • seed : The seed for generating random numbers.

  • multiprocessing : Number of processes to use multiprocessing to generate crystal graphs. Set to -1 to use all available CPUs. Set to 0 (default) to disable multiprocessing. WARNING: The keyword “num_threads” is incompatible with multiprocessing. If you use multiprocessing with a value of 1 or higher, the crystal graphs generation process will ignore the num_threads keyword. For optimal performance and memory usage, we recommend setting multiprocessing = 0 and adjusting num_threads to control the number of threads. This is because generating crystal graphs can consume large memory.

  • orbital : A JSON format string that defines matrix elements to be predicted. For example, let denotes DFT Hamiltonian matrix element between orbital of atom and orbital of atom . An input of [{"N1 N2": [a1, a2], "N3 N4": [a3, a4], "N5 N6": [a5, a6]}, {"N7 N8": [a7, a8]}] can be set for the orbital option, if you want to predict two matrix elements H1 and H2 for the edge feature of an atom pair , where

    and
    Alternatively, a Python script at DeepH-pack/tools/get_all_orbital_str.py can be used to generate a default configuration to predict all orbitals with one model.

graph

  • create_from_DFT : Whether to use the DFT Hamiltonian matrices to create the graph instead of setting the cut-off radius by hand. It is recommended to set create_from_DFT to True and not to set radius for training.

  • radius : The cut-off radius to create graph. Keyword radius has no effect if create_from_DFT is set to True.

train

  • epochs : The number of passes of the entire training dataset the learning algorithm has completed.

  • pretrained : The path to the pretrained model, e.g. /your/pretrained/model/best_state_dict.pkl.

  • resume : The path to the half-trained model, e.g. /your/half_trained/model/best_state_dict.pkl.

  • train_ratio : The ratio of training data.

  • val_ratio : The ratio of validation data.

  • test_ratio : The ratio of test data.

hyperparameter

  • batch_size : The size of mini-batch.

  • learning_rate : Initial learning rate.

network

  • atom_fea_len : The number of atom features in MPNN layers.

  • edge_fea_len : The number of edge features in MPNN layers.

  • gauss_stop : The stopping radius of basis functions used to represent interatomic distances.

  • num_l : The number of angular quantum numbers that spherical harmonic functions have.

  • distance_expansion : Which basis functions are used to represent interatomic distances. choices = ['GaussianBasis', 'BesselBasis', 'ExpBernsteinBasis']

  • normalization : Which form of normalization layers are used. choices = ['BatchNorm', 'LayerNorm', 'PairNorm', 'InstanceNorm', 'GraphNorm', 'DiffGroupNorm', 'None']

-atom_update_net : Which form of convolutional layers to update atom features are used. choices = ['CGConv', 'GAT', 'PAINN']