Re: BMRB heteronuclear relaxation records together with multiple spin types. -- February 27, 2009

Hi,

I have tried to provide more information below. I am reworking the crossrelaxation portions of the dictionary and will let you know when it isavailable. Should not be but a few days.


Cheers,
Eldon



Edward d'Auvergne wrote:

Hi,

I have to apologise for the very basic questions - my understanding of
the NMR-STAR dictionary is very basic at the moment and I'm trying to
learn it as fast as possible!  And converting all data types from
relax into NMR-STAR v3.1 format and then loading it back again into
relax requires a very deep understanding.  My design is a generic API
which interfaces relax to the pystarlib library, the former containing
the data you would like and the later writing and reading basic STAR
formatted files.  This API will be a package - separable from relax,
so maybe interesting for you for other purposes - which implements
reading and writing of the NMR-STAR dictionary v3.1.  It is designed
to handle different versions of the NMR-STAR dictionary, and the
incomplete code is currently located in the 'bmrblib' directory in the
bmrb relax branch (http://svn.gna.org/viewcvs/relax/).  What I would
like for this API is that the calling program dumps the data to it in
arrays, floats, etc., and the library creates a partial v3.1 saveframe
(including required tags, tags necessary for smooth 2 way data flow,
and any other useful, non-internal tags).  A complementary set of
functions returns the data from a NMR-STAR file.  There may be a lot
of overlap with CCPN here, but as the CCPN data model is not so good
with dynamics data (yet!), I think this is the best way forward at
this point.  More below...

The work you are doing should be very useful and when complete we wouldbe pleased if you would allow us to make the API package available fromthe BMRB web site.


On Wed, Feb 25, 2009 at 10:57 PM, Eldon Ulrich <elu@xxxxxxxxxxxxx> wrote:

Hi Edward,

Sorry for the delayed response. The file looks like a good start. I have
provided answers below.


Once the basic infrastructure for this API is in place, generating the
saveframes for all the dynamics data will be trivial.

Thanks,
Eldon

Edward d'Auvergne wrote:

Dear Eldon,

relax is now producing NMR-STAR dictionary files containing relaxation
data (one truncated example is attached), although these are obviously
far from complete.  I have a few questions as this does not work for
all molecule types.  For example if you have a protein or RNA system
and have collected both 15N and 13C R1 relaxation data.  Should these
go into 2 separate heteronucl_T1_relaxation saveframes?

In the past 15N and 13C relaxation data would be put in separate saveframes,
but if the new 'coherence_type' values are used all data of the same
coherence type could be included in one table. The atom names are required.


Which would you prefer I implement right now if people in a few months
time would like, or are asked, to submit dynamics data?  For BMRB
submission, I would assume that relax's NMR-STAR file generation need
not be 100% complete?  Oh, is this new 'coherence_type' part of v3.1
or will it be a future version?

I would prefer that you implement the new version of the coherence typevalues. This seems to be a better way to go. Do you agree?

 Or if you are

studying a protein with 15N labelling and have data for both the
backbone amide and the Trp indole ring?

Here, the data can be in the same table for sure, you just need to provide
the atom name as a value to the tag _T1.Atom_ID. This tag should have the
value of 'N' for backbone nitrogen in any case. We do not assume an atom
name. This will be especially true if you use the new values for coherence
type (Iz, etc.) we have discussed.


I might ask later how to get this structure info into the entity saveframes.

And is there more information

on how to construct the molecular entities other than at
http://www.bmrb.wisc.edu/deposit/mol_assembly/?

The molecular assembly is made up of the polymer chains and ligands andcofactors that are tightly bound (hemes, iron sulfur clusters, etc.) orexchanging between bound and free states. In general, things likebuffers, solvents, and lipids are not considered part of the molecularassembly. Each component in the assembly is given an ID(_Entity_assembly.ID). Both a homodimer and a heterodimer assembly wouldhave two _Entity_assembly.ID values, one for each polymer chain in theassembly. For each unique polymer or ligand in the assembly, an entitysaveframe is required (one entity for a homodimer assembly, two entitiesfor a heterodimer assembly, for example). Entities can be eitherpolymers or non-polymers. For polymer entities, the sequence is given inboth a one-letter and three-letter format.

More information on required tags and their descriptions is available here:

http://www.bmrb.wisc.edu/dictionary/3.1html_frame/frame_index.html


I'm using exactly this documentation now to implement this.  I have
noticed a problem though.  The file:

http://www.bmrb.wisc.edu/dictionary/3.1html/TablePage.html

You may not be using the 'frames' version of the dictionary. Thisversion may provide more detailed information for each tag.


does not always fully load when you ask for it.  Is this being
automatically generated?


We will need to look into this problem. It seems to work for me here.

I can try to put together more examples and send them to you this weekend.

Also, for submission, should relax produce the full set of records for
this data, so for example all of:

The required tags are annotated below.

In the header part of each saveframe there is a tag something like"_Heteronuclear_T1_list.ID". This tag is required and its values are acounter for each instance of a saveframe of that category. Your examplefile that you attached to this e-mail loaded into the ADIT-NMRdeposition system, but only one saveframe was created for each category.I think this is because the '_xxx.ID' tag was missing from thesaveframe header.

   loop_
       _T1.ID          should be provided as a counter
       _T1.Assembly_atom_ID    not needed
       _T1.Entity_assembly_ID  not abosolutely required always has a value
of 1
       _T1.Entity_ID           required particularly for molecular
assemblies that have more than one entity (heterodimers, complexes, etc.)
       _T1.Comp_index_ID       required, this is the residue sequence
number
       _T1.Seq_ID              not needed
       _T1.Comp_ID             required, residue type
       _T1.Atom_ID             required, this is the atom name and allows
for things like backbone nitrogen and sidechain nitrogen data to be listed
in the same table.
       _T1.Atom_type           not absolutely required but useful to
distinguish CA from calcium.
       _T1.Atom_isotope_number not required, but useful to distinguish 1H
from 2H relaxation
       _T1.Val                 required
       _T1.Val_err                     required in general
       _T1.Resonance_ID                not needed
       _T1.Auth_entity_assembly_ID     not needed
       _T1.Auth_seq_ID         not needed
       _T1.Auth_comp_ID                not needed
       _T1.Auth_atom_ID                not needed
       _T1.Entry_ID                    not needed
       _T1.Heteronucl_T1_list_ID       required, particularly for entries
with more than one T1/R1 table


This will be very useful.  Is this in the individual tag descriptions?
 I cannot find this information in the online documentation.

This information, in a more formal presentation, is available in the'frames' version of the NMR-STAR v3 dictionary on the web site:


http://www.bmrb.wisc.edu/dictionary/3.1html_frame/frame_index.html

If you select an individual tag, the frame on the right contains a field"Nulls allowed in database?". The value will be 'no' or 'yes'.

for R1 data?  Or can a subset of these be filled out for submission?

Only the required tags above are needed. The others can be left out

And for missing entries, should this be represented as the character
'.' or '?', as both seem to be used in the file bmr15437_3.str.gz

The use of a '?' is clearer to read. Either is allowed. By definition, ','
means null and '?' means 'should have a value, but I do not have a value to
enter'.


Ok, I'll use these trying to match the bmr15437_3.str.gz file attached
to https://gna.org/task/?6438.

Cheers,

Edward

Re: BMRB heteronuclear relaxation records together with multiple spin types.

Header

Content

Related Messages