diff --git a/43_nn.tex b/43_nn.tex index 47d45d9..f70fc53 100644 --- a/43_nn.tex +++ b/43_nn.tex @@ -95,7 +95,7 @@ After the training, the resulting model is saved in a small \texttt{HDF5} file w \subsection{Results} -The output of the Neural Network (Figure \ref{fig:nnresults}) looks quite \todo{end sentence} +The output of the Neural Network (Figure \ref{fig:nnresults}) looks quite similar to the RBF output. There seem to be very few details visible, but just like in the other results the transition from high water fraction to low water fraction is clearly visible. \begin{figure} \centering @@ -117,8 +117,8 @@ The output of the Neural Network (Figure \ref{fig:nnresults}) looks quite \todo{ \end{figure} % !TeX spellcheck = en_US -\begin{table} - \centering +\begin{table}[b] + \centering \begin{tabular}{rcc} & {mean squared error} & {mean error} \\ griddata (only original data) & 0.014 & 0.070 \\ diff --git a/50_conclusion.tex b/50_conclusion.tex index c0ddf37..390b9ae 100644 --- a/50_conclusion.tex +++ b/50_conclusion.tex @@ -1,14 +1,15 @@ +% !TeX spellcheck = en_US \chapter{Comparison and Conclusion} \label{sec:comparison} -All three methods for interpolation described above give results that follow the rough correlations from Section \ref{sec:cov}. So to compare them more precisely and measure their accuracy an additional set of 100 simulations (with the same properties as the ones listed in Section \ref{sec:resimulation}) was created. These results are neither used to train or select the neural network, nor are in the dataset for griddata and RBF interpolation. Therefore, we can use them to generate predictions for their parameters and compare them with the real fraction of water that remained in those simulations. By taking the mean absolute difference and the mean squared error between the predictions and the real result, the accuracy of the different methods can be estimated (Table \ref{tab:comparison}). As one of these parameter sets is outside the convex hull of the training data and griddata can't extrapolate, this simulation is skipped and only the remaining 99 simulations are considered for the griddata accuracy calculation. +All three methods for interpolation described above give results that follow the rough correlations from Section \ref{sec:cov}. So to compare them more precisely and measure their accuracy, an additional set of 100 simulations (with the same properties as the ones listed in Section \ref{sec:resimulation}) was created. These results are neither used to train or select the neural network, nor are in the dataset for griddata and RBF interpolation. Therefore, we can use them to generate predictions for their parameters and compare them with the real fraction of water that remained in those simulations. By taking the mean absolute difference and the mean squared error between the predictions and the real result, the accuracy of the different methods can be estimated (Table \ref{tab:comparison}). As one of these parameter sets is outside the convex hull of the training data and griddata can't extrapolate, this simulation is skipped and only the remaining 99 simulations are considered for the griddata accuracy calculation. Of the three methods, the trained neural network has the highest mean squared error. This seems to be at least partly caused by the fact that during training of the neural network, the data is strongly generalized, causing the final network to output the \enquote{smoothest} interpolations. While this causes the errors to be higher, it might be possible that the fine structured details in the simulation output are just an artifact of the simulation setup and doesn't represent real world collisions. Another important aspect to compare is the interpolation speed. The neural network is able to give the 100 results in about \SI{4}{\milli\second} (after loading the trained model which takes approximately one second). RBF interpolation is still reasonably fast, taking about \SI{8.5}{\second} (\SI{85}{\milli\second} per interpolation). But as \texttt{griddata} expects a grid-based parameter space, it becomes really slow when adding the resimulation data with random parameters. A single interpolation takes about \SI{35}{\second} totaling to about an hour for all 99 test cases. Using only the original dataset brings the run time down to around \SI{10}{\second}, but causes the results to be less accurate than all other methods. (first row in Table \ref{tab:comparison}) -Interpolation using Radial Basis Functions all in all seems to be the most reliable method if there is enough input data and this input data is mostly spread randomly across the parameter space. It is easy to implement and quite fast to execute while still giving reasonable results. Neural Networks can also create realistic output, but have lots more configurable parameters that need to be tuned to get usable results. Their main advantage is, that in case the input set was by magnitudes larger, only the training would take longer, while evaluating the trained model wouldn't change. +Interpolation using Radial Basis Functions all in all seems to be the most reliable method if there is enough input data and this input data is mostly spread randomly across the parameter space. It is easy to implement and quite fast to execute while still giving reasonable results. Neural Networks can also provide realistic output, but have lots more configurable parameters that need to be tuned to get usable results. Their main advantage would be more noticeable if the input set was by magnitudes larger. In this case only the training would take longer, while evaluating the trained model wouldn't change. -To sum up, it is possible to estimate the amount of water lost in two-body-collisions with arbitrary collision parameters by simulating the outcome of a large amount of collisions using \texttt{SPH} and then doing linear interpolations to get results for parameters in between the ones from the simulation set. While the amount of remaining water is overestimated in this analysis as thermal effects during the collision are ignored, but the results are better than a perfect merging assumption. +To sum up, it is possible to estimate the amount of water lost in two-body-collisions with arbitrary collision parameters by simulating the outcome of a large amount of collisions using \texttt{SPH} and then doing linear interpolations to get results for parameters in between the ones from the simulation set. While the amount of remaining water is overestimated in this analysis as thermal effects during the collision are ignored, the results are better than a perfect merging assumption. diff --git a/60_massretention.tex b/60_massretention.tex index fa1efe3..a99890e 100644 --- a/60_massretention.tex +++ b/60_massretention.tex @@ -1,9 +1,9 @@ % !TeX spellcheck = en_US \chapter{Core mass retention} -While this thesis focuses on the water retention after the collisions, the same methods can be applied to the fraction of basalt from the core of the two bodies that remains after the collision. Using the same parameters for interpolation and a separately trained model with the same parameters results in similar results as for water retention. When plotted just like before in Figure \ref{fig:mass_results}, one can see that the results are quite similar. The main difference is that on average there is a slightly higher core mass retention, which can be explained by the fact that weaker collisions might be strong enough to throw the outer water layer into space, but keep the core intact. In addition, it seems like the border between high and low core mass retention is smaller.\todo{better phrase} +While this thesis focuses on the water retention after the collisions, the same methods can be applied to the fraction of basalt from the core of the two bodies that remains after the collision. Using the same parameters for interpolation and a separately trained model with the same parameters results in similar results as for water retention. When plotting the same scenarios as in the previous chapters (Figure \ref{fig:mass_results}), one can see that the results are quite similar. The main difference is that on average there is a slightly higher core mass retention, which can be explained by the fact that weaker collisions might be strong enough to throw the outer water layer into space, but keep the core intact. In addition, it seems like the transition between high and low core mass retention is faster. -When applying the same comparison as described in Chapter \ref{sec:comparison} the interpolations seem to have a lower accuracy, but still RBF interpolation gives the best results considering slow speed of griddata. +When applying the same comparison as described in Chapter \ref{sec:comparison} the interpolations seem to have a lower accuracy, but still RBF interpolation gives the best results considering the slow speed of griddata. \begin{table} diff --git a/bibliographie.bib b/bibliographie.bib index f45fe68..80847fd 100644 --- a/bibliographie.bib +++ b/bibliographie.bib @@ -182,8 +182,6 @@ journaltitle = {Computers and Mathematics with Applications}, date = {2019-08}, pages = {submitted}, - eprint = {0705.2672}, - eprinttype = {arXiv}, keywords = {smooth particle hydrodynamics, GPU-computing, hydrodynamics, continuum mechanics}, }