Skip to content

Commit

Permalink
Merge pull request #8 from ccdc-opensource/IJS_initial_template
Browse files Browse the repository at this point in the history
Ijs initial template
  • Loading branch information
jonasnyman1982 authored Sep 4, 2024
2 parents f0be7c6 + abc57f6 commit 0cb4f4a
Show file tree
Hide file tree
Showing 18 changed files with 197 additions and 127 deletions.
2 changes: 2 additions & 0 deletions 4-Hydroxyacetophenone/Form_II_data.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
HACTPH10
Identifier, Property, Value (unit), Std, N, Name, Reference , Comment
2 changes: 2 additions & 0 deletions 4-Hydroxyacetophenone/Form_I_data.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
HACTPH12
Identifier, Property, Value (unit), Std, N, Name, Reference , Comment
18 changes: 9 additions & 9 deletions Benzophenone/Monoclinic_beta_enthalpy_of_fusion.csv
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
BPHENO03,,,,,,,
Identifier,Property,Value (kJ/mol)," Std"," N", Name,Reference ," Comment"
1, Enthalpy of fusion,13.8, ,, Dudek, M_Dudek_2024.1,
2, Enthalpy of fusion,13.8, ,, Dudek, M_Dudek_2024.1,
3, Enthalpy of fusion,14.3, ,, Dudek, M_Dudek_2024.1,
4, Enthalpy of fusion,14.2, ,, Dudek, M_Dudek_2024.1,
5, Enthalpy of fusion,13.8, ,, Dudek, M_Dudek_2024.1,
6, Enthalpy of fusion,14.1, ,, Dudek, M_Dudek_2024.1,
7, Enthalpy of fusion,13.8, ,, Neumann, https://doi.org/10.1515/zpch-1932-16103 ,
8, Enthalpy of fusion,14.5,0.15,, Stejfa, https://doi.org/10.1021/acs.jced.6b00523,
Identifier, Property, Value (kJ/mol), Std, N, Name, Reference , Comment
1, Enthalpy of fusion,13.8, , , Dudek, M_Dudek_2024.1,
2, Enthalpy of fusion,13.8, , , Dudek, M_Dudek_2024.1,
3, Enthalpy of fusion,14.3, , , Dudek, M_Dudek_2024.1,
4, Enthalpy of fusion,14.2, , , Dudek, M_Dudek_2024.1,
5, Enthalpy of fusion,13.8, , , Dudek, M_Dudek_2024.1,
6, Enthalpy of fusion,14.1, , , Dudek, M_Dudek_2024.1,
7, Enthalpy of fusion,13.8, , , Neumann, https://doi.org/10.1515/zpch-1932-16103 ,
8, Enthalpy of fusion,14.5,0.15, , Stejfa, https://doi.org/10.1021/acs.jced.6b00523,
20 changes: 10 additions & 10 deletions Benzophenone/Monoclinic_beta_melting_point.csv
Original file line number Diff line number Diff line change
@@ -1,11 +1,11 @@
BPHENO03,,,,,,,
Identifier," Property"," Value (K)"," Std"," N", Name, Reference ," Comment"
1," Melting point",299.04," ",1,Dudek,M_Dudek_2024.1,
2," Melting point",299.08," ",1,Dudek,M_Dudek_2024.1,
3,Melting point,299.18, ,1,Dudek,M_Dudek_2024.1,
4,Melting point,299.02, ,1,Dudek,M_Dudek_2024.1,
5,Melting point,299.29, ,1,Dudek,M_Dudek_2024.1,
6, Melting point,299.32, ,1,Dudek,M_Dudek_2024.1,
7,Melting point,299.25, ,1,Neumann, https://doi.org/10.1515/zpch-1932-16103 ,
8,Melting point,298.3,0.15,,Stejfa, https://doi.org/10.1021/acs.jced.6b00523 ,
9, Melting point,298.7,0.8,,Rietveld, https://doi.org/10.1038/s41598-023-38985-y ,
Identifier, Property, Value (K), Std, N, Name, Reference , Comment
1, Melting point, 299.04, , 1, Dudek,M_Dudek_2024.1,
2, Melting point, 299.08, , 1, Dudek,M_Dudek_2024.1,
3, Melting point, 299.18, , 1, Dudek,M_Dudek_2024.1,
4, Melting point, 299.02, , 1, Dudek,M_Dudek_2024.1,
5, Melting point, 299.29, , 1, Dudek,M_Dudek_2024.1,
6, Melting point, 299.32, , 1, Dudek,M_Dudek_2024.1,
7, Melting point, 299.25, , 1, Neumann, https://doi.org/10.1515/zpch-1932-16103 ,
8, Melting point, 298.3, 0.15, , Stejfa, https://doi.org/10.1021/acs.jced.6b00523 ,
9, Melting point, 298.7, 0.8, , Rietveld, https://doi.org/10.1038/s41598-023-38985-y ,
2 changes: 1 addition & 1 deletion Benzophenone/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# System Template

This is the folder for recording physcal property data for the Benzophenone system.
This is the folder for recording physical property data for the Benzophenone system.
10 changes: 5 additions & 5 deletions Febuxostat/Form_III_H1_triclinic_Melting_point.csv
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
HIQQAB02,,,,,,,
Identifier,Property,Value (K)," Std"," N", Name,Reference ," Comment"
1,Melting Point,476.49,,,Pop,Pop_2024_COST_data,30K/Min
2,Melting Point,477.43,,,Pop,Pop_2024_COST_data,30K/Min
3,Melting Point,477.91,,,Pop,Pop_2024_COST_data,40K/Min
4,Melting Point,478.34,,,Pop,Pop_2024_COST_data,40K/Min
Identifier, Property, Value (K), Std, N, Name, Reference , Comment
1, Melting Point, 476.49, , , Pop, Pop_2024_COST_data, 30K/Min
2, Melting Point, 477.43, , , Pop, Pop_2024_COST_data, 30K/Min
3, Melting Point, 477.91, , , Pop, Pop_2024_COST_data, 40K/Min
4, Melting Point, 478.34, , , Pop, Pop_2024_COST_data, 40K/Min
10 changes: 5 additions & 5 deletions Febuxostat/Form_II_Q_monoclinic_Melting_point.csv
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
HIQQAB,,,,,,,
Identifier,Property,Value (K)," Std"," N", Name,Reference ," Comment"
1,Melting Point,481,,,Pop,Pop_2024_COST_data,30K/Min
2,Melting Point,482.09,,,Pop,Pop_2024_COST_data,30K/Min
3,Melting Point,481.78,,,Pop,Pop_2024_COST_data,40K/Min
4,Melting Point,483.32,,,Pop,Pop_2024_COST_data,40K/Min
Identifier, Property, Value (K), Std, N, Name, Reference , Comment
1, Melting Point, 481, , , Pop, Pop_2024_COST_data, 30K/Min
2, Melting Point, 482.09, , , Pop, Pop_2024_COST_data, 30K/Min
3, Melting Point, 481.78, , , Pop, Pop_2024_COST_data, 40K/Min
4, Melting Point, 483.32, , , Pop, Pop_2024_COST_data, 40K/Min
12 changes: 6 additions & 6 deletions Febuxostat/Form_I_A_orthorhombic_Melting_point.csv
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
HIQQAB06,,,,,,,
Identifier,Property,Value (K)," Std"," N", Name,Reference ," Comment"
1,Melting Point,482.98,,,Teracrystal,Pop_2024_COST_data,10K/Min
2,Melting Point,486.43,,,Pop,Pop_2024_COST_data,30K/Min
3,Melting Point,486.63,,,Pop,Pop_2024_COST_data,30K/Min
4,Melting Point,487.85,,,Pop,Pop_2024_COST_data,40K/Min
5,Melting Point,488.19,,,Pop,Pop_2024_COST_data,40K/Min
Identifier, Property, Value (K), Std, N, Name, Reference , Comment
1, Melting Point, 482.98, , ,Teracrystal, Pop_2024_COST_data, 10K/Min
2, Melting Point, 486.43, , ,Pop, Pop_2024_COST_data, 30K/Min
3, Melting Point, 486.63, , ,Pop, Pop_2024_COST_data, 30K/Min
4, Melting Point, 487.85, , ,Pop, Pop_2024_COST_data, 40K/Min
5, Melting Point, 488.19, , ,Pop, Pop_2024_COST_data, 40K/Min
13 changes: 13 additions & 0 deletions Phenylpiracetam/Form_A_data.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
SANBAN
Identifier, Property, Value (K), Std, N, Name, Comment, Reference
1, Melting point, 383.55, , , Kons, 20 K/min, Artis Kons e-mail 3 Sept 2024
2, Melting point, 383.65, , , Kons, 20 K/min,
3, Melting point, 383.19, , , Kons, 10 K/min,
4, Melting point, 383.05, , , Kons, 10 K/min,
5, Melting point, 382.91, , , Kons, 10 K/min,
6, Melting point, 383.25, , , Kons, 5 K/min,
7, Melting point, 384.55, , , Kons, 5 K/min,
8, Melting point, 382.37, , , Kons, 1 K/min,
9, Melting point, 381.95, , , Kons, 1 K/min,
10, Melting point, 381.3 , , , Kons, 1 K/min,
11, Melting point, 382.35, , , Rekis_17, , 10.1021/acs.cgd.6b01867
8 changes: 8 additions & 0 deletions Phenylpiracetam/Form_A_enthalpy_melting.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
SANBAN
Identifier, Property, Value (kJ/mol), Std, N, Name, Comment, Reference
1 , Enthalpy of fusion, 26.80838, , , Kons, Artis Kons e-mail 2 Sept 2024,
2 , Enthalpy of fusion, 26.53993, , , Kons, Artis Kons e-mail 2 Sept 2024,
3 , Enthalpy of fusion, 25.66036, , , Kons, Artis Kons e-mail 2 Sept 2024,
4 , Enthalpy of fusion, 25.93973, , , Kons, Artis Kons e-mail 2 Sept 2024,
5 , Enthalpy of fusion, 25.7673, , , Kons, Artis Kons e-mail 2 Sept 2024,
6 , Enthalpy of fusion, 25.1, 0.6, , Rekis_17, , 10.1021/acs.cgd.6b0186
14 changes: 14 additions & 0 deletions Phenylpiracetam/Form_B_enthalpy_melting.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
Phenylpiracetam Form B
Identifier, Property, Value (kJ/mol), Std, N, Name, Comment, Reference
1 , Enthalpy of fusion, 23.94923, , , , Kons, Artis Kons e-mail 3 Sept 2024,
2 , Enthalpy of fusion, 24.34864, , , , Kons, Artis Kons e-mail 3 Sept 2024,
3 , Enthalpy of fusion, 24.63892, , , , Kons, Artis Kons e-mail 3 Sept 2024,
4 , Enthalpy of fusion, 24.32681, , , , Kons, Artis Kons e-mail 3 Sept 2024,
5 , Enthalpy of fusion, 24.55162, , , , Kons, Artis Kons e-mail 3 Sept 2024,
6 , Enthalpy of fusion, 25.45083, , , , Kons, Artis Kons e-mail 3 Sept 2024,
7 , Enthalpy of fusion, 25.15619, , , , Kons, Artis Kons e-mail 3 Sept 2024,
8 , Enthalpy of fusion, 24.95103, , , , Kons, Artis Kons e-mail 3 Sept 2024,
9 , Enthalpy of fusion, 23.92959, , , , Kons, Artis Kons e-mail 3 Sept 2024,
10 , Enthalpy of fusion, 23.32284, , , , Kons, Artis Kons e-mail 3 Sept 2024,
11 , Enthalpy of fusion, 23.97760, , , , Kons, Artis Kons e-mail 3 Sept 2024,
12 , Enthalpy of fusion, 22.4 , 0.3, , , Rekis_17, , 10.1021/acs.cgd.6b0186
13 changes: 13 additions & 0 deletions Phenylpiracetam/Form_B_melting.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
Phenylpiracetam Form B
Identifier, Property, Value (K), Std, N, Name, Comment, Reference
1 , Melting point, 390.42, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
2 , Melting point, 391.28, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
3 , Melting point, 388.57, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
4 , Melting point, 389.10, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
5 , Melting point, 389.22, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
6 , Melting point, 389.59, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
7 , Melting point, 389.02, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
8 , Melting point, 390.77, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
9 , Melting point, 390.00, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
10 , Melting point, 390.30, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
11 , Melting point, 390.29, , , Kons, 20 K/min , Artis Kons e-mail 3 Sept 2024
57 changes: 50 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,55 @@
# collaboration-bestcsp-experimental-data

Sharing of experimental data for validating computational methods as part of the BEST-CSP project, see <https://www.cost.eu/actions/CA22107> and <https://best-csp.eu/>
Sharing of experimental data for validating computational methods as part of the BEST-CSP project,
see <https://www.cost.eu/actions/CA22107> and <https://best-csp.eu/>

This repository contains individual folders for each of the systems considered in this Action, and an example "System_template" to illustrate the format
This repository contains individual folders for each of the compounds considered in this Action,
and an example "System_template" to illustrate the format.

Within each folder are .csv files for each polymorph, for standardised recording of data. Keeping the experimental data
to a standard format will aid model building/ statistical assessment of results, and csv files are machine readable
Within each folder are .csv files for each polymorph, for standardised recording of data.
Keeping the experimental data to a standard format will aid in the statistical treatment of results,
and csv files are machine readable, facilitating plotting and tabulating the data.

There is also a python script for deriving consensus means from data generated in different labs, in the top directory, called stats.py.
It requires the modules statsmodules and matplotlib; it may be simpler to make a conda env and install those modules, but I have included
environment.yaml as a snapshot for the pictures used in the Paris meeting (20/06/2024)
There is also a Python script in the top directory for calculating consensus means from data generated in different labs called stats.py.
The script requires the modules statsmodels and matplotlib; it may be simpler to make a conda environment and install those modules,
but I have included environment.yaml as a snapshot for the pictures used in the Paris meeting (20/06/2024).

## Filling in CSV files

In writing the csv files, please consider the following guidance.
Each line can contain either a single data point, or the mean value and standard deviation of several data points from the same experiment/lab.
It is very important that the uncertainty or error bar written in the csv file is one sample standard deviation.
If you calculate it yourself, make sure to use the formula for sample standard deviation, with n-1 in the denominator.
In literature, other kinds of error bars are often used, and it is important that you try to figure out what kind of error bar it is.
If the paper doesn't specify, we assume that it is one std.
All data we have should be entered into the CSV. Outliers and questionable data should also be included for the sake of transparancy and completeness.
We then exclude such data by commenting out lines, and adding a statement about why that data point was rejected.
It is important that the entered data can be traced back to its origin. For literature, DOI or other reference that makes it possible
to find the study should be added.

What the Python script does:
If there are several data points from the same lab/name, the script will assume that they
are all from the same experiment and merge the data into a mean value and standard deviation.
If there are several literature studies that have only reported a single number without error bar,
the script will merge them and treat the data from all of them as one additonal laboratory.

DSC and other thermal data:
Melting temperature (temperature of fusion) should be the onset of the melting endotherm.
Values in J/g should be converted to kJ/mol. Take care in finding the correct molar mass.
We only use SI units. All temperatures should be in Kelvin.

The first line in each CSV file contains a CSD reference specifying a crystal structure.
This is a prototypical structure that defines which compound and polymorph the data is for.
Make sure that the data you enter are for that polymorph.

Each line in the csv file then contaisn the following fields, separated by commas:

- Id: We number the data, so that it becomes easier to refer to a specific data point by specifying the filename and row number.
- Physical property: Some description of what was measured, the physical property.
- Value: This is the measured value for the physical property at that temperature, pressure. The units should be in the column heading
- Std: The sample standard deviation, if available.
- N: The number of data points, if a mean value was entered.
- Temperature: (Optional) if the Property is temperature dependent (i.e. Heat Capacity at 290K etc.,) then add a column here
- Name: The name of the scientist or the PI of the lab that did the measurements. The main importance is traceability, we need to keep track of where tha data came from.
For literature data, add the year it was published as Name_YY. For newly collected data, omit the year.
- Comment, Ref: Free text comments, DOI, experimental conditions or other info.
36 changes: 18 additions & 18 deletions Sulfamerazine/Form_one_enthalpy_of_fusion.csv
Original file line number Diff line number Diff line change
@@ -1,18 +1,18 @@
SLFNMA02,,,,,,,,,,,,,,,,,,,
Identifier,Property,Value(kJ/mol),Std,N,Name,Reference,Comment,,,,,,,,,,,,
1,Enthalpy of fusion,39.28,,1,Wood,W_Wood_Innsbruck_2024,10K/min,,,,,,,,,,,,
2,Enthalpy of fusion,38.67,,1,Wood,W_Wood_Innsbruck_2024,10K/min,,,,,,,,,,,,
3,Enthalpy of fusion,37.95,,1,Wood,W_Wood_Innsbruck_2024,10K/min,,,,,,,,,,,,
4,Enthalpy of fusion,37.66,,1,Wood,W_Wood_Innsbruck_2024,10K/min,,,,,,,,,,,,
5,Enthalpy of fusion,37.37,,1,Wood,W_Wood_Innsbruck_2024,10K/min,,,,,,,,,,,,
6,Enthalpy of fusion,38.14,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition,,,,,,,,,,,,
7,Enthalpy of fusion,38.48,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition,,,,,,,,,,,,
8,Enthalpy of fusion,37.64,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition,,,,,,,,,,,,
9,Enthalpy of fusion,39.57,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition,,,,,,,,,,,,
10,Enthalpy of fusion,37.98,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition,,,,,,,,,,,,
11,Enthalpy of fusion,40.8,0.2,3,Zhang_02,https://doi.org/10.1002/jps.10100,,,,,,,,,,,,,
12,Enthalpy of fusion,36.3,1,6,Yang_72,https://doi.org/10.1002/jps.2600610104,10K/min,,,,,,,,,,,,
13,Enthalpy of fusion,45.8,,1,Khattab_83,https://doi.org/10.1016/0040-6031(83)80280-9,2K/min,,,,,,,,,,,,
14,Enthalpy of fusion,40.17,0.872410454,5,Li_11,https://doi.org/10.1016/j.ijpharm.2011.05.058,2K/min,,,,,,,,,,,,
15,Enthalpy of fusion,39.4,,1,Caria_92,https://doi.org/10.1107/S0108768192000910,Theenthalpyandentropyoffusionfromrepeatedrunsyieldedrangesof38.5-40.3kJ/moland75.6-79.1J/K/molrespectively,,,,,,,,,,,,
16,Enthalpy of fusion,41.1,,1,Maury_86,https://doi.org/10.1002/jps.2600740411,,,,,,,,,,,,
SLFNMA02,,,,,,,
Identifier,Property,Value (kJ/mol),Std,N,Name,Reference,Comment
1,Enthalpy of fusion,39.28,,1,Wood,W_Wood_Innsbruck_2024,10K/min
2,Enthalpy of fusion,38.67,,1,Wood,W_Wood_Innsbruck_2024,10K/min
3,Enthalpy of fusion,37.95,,1,Wood,W_Wood_Innsbruck_2024,10K/min
4,Enthalpy of fusion,37.66,,1,Wood,W_Wood_Innsbruck_2024,10K/min
5,Enthalpy of fusion,37.37,,1,Wood,W_Wood_Innsbruck_2024,10K/min
6,Enthalpy of fusion,38.14,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition
7,Enthalpy of fusion,38.48,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition
8,Enthalpy of fusion,37.64,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition
9,Enthalpy of fusion,39.57,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition
10,Enthalpy of fusion,37.98,,1,Wood,W_Wood_Innsbruck_2024,10K/minafterFII->Itransition
11,Enthalpy of fusion,40.8,0.2,3,Zhang_02,https://doi.org/10.1002/jps.10100,
12,Enthalpy of fusion,36.3,1,6,Yang_72,https://doi.org/10.1002/jps.2600610104,10K/min
13,Enthalpy of fusion,45.8,,1,Khattab_83,https://doi.org/10.1016/0040-6031(83)80280-9,2K/min
14,Enthalpy of fusion,40.17,0.872410454,5,Li_11,https://doi.org/10.1016/j.ijpharm.2011.05.058,2K/min
15,Enthalpy of fusion,39.4,,1,Caria_92,https://doi.org/10.1107/S0108768192000910,Theenthalpyandentropyoffusionfromrepeatedrunsyieldedrangesof38.5-40.3kJ/moland75.6-79.1J/K/molrespectively
16,Enthalpy of fusion,41.1,,1,Maury_86,https://doi.org/10.1002/jps.2600740411,
8 changes: 4 additions & 4 deletions Sulfamerazine/Form_two_melting_point.csv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
SLFNMA01,,,,,,,,,,,
Identifier," Property"," Value (K)"," Std"," N"," Name"," Reference ",Comment,,,,
1," Melting point",486.15,,1,Zhang_02," https://doi.org/10.1002/jps.10100"," ",,,,
2," Melting point",485.60,0.387298335,4,Li_11," https://doi.org/10.1016/j.ijpharm.2011.05.058",collected results at different heating rates,,,,
SLFNMA01,,,,,,,
Identifier," Property"," Value (K)"," Std"," N"," Name"," Reference ",Comment
1,Melting point,486.15,,1,Zhang_02," https://doi.org/10.1002/jps.10100"," "
2,Melting point,485.6,0.387298335,4,Li_11," https://doi.org/10.1016/j.ijpharm.2011.05.058",collected results at different heating rates
Loading

0 comments on commit 0cb4f4a

Please sign in to comment.