I am a R user and have been playing with R package scidb for a month or so. It is great to have a database back-end to manage data for analysis in R or any other software.
In scidb dimension indices are either integer (like i = 1…n) or string (non-integer dimension). On the other hand, R matrix or data.frame allow both indexing by integer or string (optional). Since it is possible in scidb to map string index to integer (not in order any more), the mechanism already exist for coexistence of integer index and string index. In fact there is no need to have non-integer dimensions (they are all integer anyway), but users can assign a string name to each index. Another advantage would be selection of a range would also work for non-integer dimensions (if their order is preserved).
My plan is to use scidb as a data warehouse for various formats of high-through put data, like genetics, mass spectrometry, transcriptomics, etc. There are usually hundreds to millions of variables, ordered (by mass, retention time, etc), 2 dimensional arrays. I want to be able to select a range of variables and also to see their names easily.
Another question here, is it possible to share 1000 genome project scidb use case? I also would like to have web interface to some of the data in scidb.