Hi there helpful people,
I’ve run some experiments and have a few types of data relating to sections of tissue. We call this multi-modal sample analysis. One data set consists of a 3D array of hyperspectral data (128 x 128 pixels x 800 wavenumbers/bands), and each wavenumber has an absorbance value associated with it. I also have matching visible images which I plan to load as array data for visible image analysis. My goal is to integrate the hyperspectral data and the visible image data for analysis.
I’m considering loading the hyperspectral data into SciDB as a 2D array with two attributes (wavenumber and absorbance). Then, creating another 2D array (x=128, y=128 pixels) with a second set of attributes to capture the visible image data. With this design the wavenumber attribute will have the same 800 values repeated for each pixel in the hyperspectral image. The arrays could then be aligned by the pixel coordinates (x, y).
Is this a good/bad design?! Are there reasons to load the hyperspectral data as a 3D array (128 x 128 x 800) with a single attribute (absorbance)?
I have 40 odd samples which I plan to analyse in one large dataset. Should I create a different array for each sample, or try to combine the samples into a single array somehow?
I would appreciate any advice or examples that are available.