Proteax for Spreadsheets

Proteax for Spreadsheets is a spreadsheet plugin that gives you protein chemistry functions within Microsoft Excel® or OpenOffice.org® Calc. You can analyze, change, and compare protein entries by using the spreadsheet functions that the Proteax add-in provides.

Proteax for Spreadsheets can visualize protein sequences and chemistry details in modified proteins using the built-in graphical live preview. Mass spectrometry work can be eased via the direct interface to Lighthouse Data's GPMAW application for protein editing and display.

Protein chemistry as normal spreadsheet functions

Proteax for Spreadsheets functions let you analyze the sequence and chemistry properties of protein entries. You can browse the available functions and interact directly with them by using the function wizards of Excel and Calc.

[Image: Proteax for Spreadsheets in Excel 2007.]

Chemistry formulas in use in Excel 2007.

[Image: OpenOffice.org Calc's function wizard supports interactive testing of Proteax functions.]

Interactive testing of functions in OpenOffice.org Calc.

Compare protein sequence and structure

Sequence and structural identity checks allow you to easily find duplicates. Proteax for Spreadsheets is able to normalize protein entries to ensure that the comparisons are not confused by the order of chains and internal rotation of cyclic chains.

[Image: Robust sequence comparisons.]

Normalization of sequences reveal identical entries that at first glance look different. Corresponding functions normalize the entry taking post-translational and chemical modifications into account.

DerNot expressions describe the edit steps necessary for producing one protein entry from another; e.g. creating a protein derivative from a known reference protein.

Proteax for Spreadsheets contain functions that can both calculate the DerNot expression needed for creating a protein derivative, and also apply a DerNot expression to a reference protein to create a derivative.

[Image: Calculating DerNot expressions.]

DerNot expressions can be used as trivial names for protein derivatives.

Reformat entries with ease

A variety of industry standard formats are used to represent protein entries. Proteax for Spreadsheets will automatically recognize entries in FASTA, UniProt, GPMAW, and PLN format whether you import them from file or type them in a spreadsheet cell. Proteax functions can be used to reformat entries inside your spreadsheet or you can export entries in the format of your choice.

[Image: Proteax FASTA reformatting example.]

Protein entries may come in a variety of formats. The mixed entries in the A column have been reformatted in column B to FASTA format using the spreadsheet function PROTEAX_AS_FASTA().

Protein visualization

Proteax for Spreadsheets has a built-in Live Preview tool for visualization of protein entries. The Live Preview tool will track the currently selected spreadsheet cell so the visualization is always in sync with the spreadsheet.

[Image: Viewing 2D chemical structure in Excel 2007.]

Integrated Live Preview lets you visualize the currently selected cell as both a protein sequence and its 2D chemical structure. Above, the amidation of GLP-1 is clearly visible when zooming in on the chemical details of the C-terminal (NB: Implicit hydrogens are not displayed in the preview but are accounted for in calculations).

For publication-level chemical visualization Proteax for Spreadsheets lets you use external applications. Below ChemAxon's MarvinSketch has been used as chemical struture renderer.

[Image: Viewing 2D chemical structure in Excel 2007.]

Protein entries can be viewed in full chemical detail in external applications.

Mass-spec analysis - GPMAW integration

Users that have Lighthouse Data's GPMAW installed can edit protein entries in GPMAW. Changes are transferred directly back into your spreadsheet at the moment you press the "Save" button in GPMAW.

[Image: Protein editing spawns external GPMAW editor.]

Editing a protein entry starts the external GPMAW editor.

[Image: Changes are transferred directly back to the spreadsheet.]

Saving changes in GPMAW triggers a transfer of the changed entry directly back into the spreadsheet.

Chemical sum formula utilities

Proteax for Spreadsheets includes functions that allow you to easily manipulate chemical sum formulas, e.g. add and subtract sum formulas and extract certain element abundances (number of nitrogens, number of oxygens...). Below you can see an example of these functions used in OpenOffice.org Calc.

[Image: Proteax formula function shown in OOo Calc.]

System requirements and availability

Proteax for Spreadsheets requires

  • 32-bit Windows® 2000, XP, Vista, Windows 7 (*) (**)
  • Microsoft Excel 2003, 2007 or OpenOffice.org Calc 3.1

You can install Proteax for Spreadsheets from a normal user account - you do not need administrator privileges. A fully functional demo version is available from the download area.


(*) Also runs on 64-bit Windows 7.

(**) Proteax for Spreadsheets also runs on Linux® as shown below but there is no official installer for it yet. Contact for more information and to discuss future availability on Linux.

[Image: Proteax for Spreadsheets on Ubuntu 9.04.]

Development version of Proteax for Spreadsheets 1.1 running on Ubuntu 9.04.