Deployment Manual: CLC Workbenches
Deployment Manual: CLC Workbenches
CLC Workbenches
Manual for
CLC Workbenches: deployment and technical information, version 1.6
Windows, Mac OS X and Linux
CLC bio
Silkeborgvej 2
Prismet
DK-8000 Aarhus C
Denmark
Contents
1 Introduction 6
1.1 Deployment strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 System requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3
CONTENTS 4
4 License 21
4.1 License server set-up on clients . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
6 Workflows 24
8 Security policies 26
10 System resources 32
10.1 Setting the amount of memory available for the JVM . . . . . . . . . . . . . . . . 32
10.2 Setting the number of cores to use . . . . . . . . . . . . . . . . . . . . . . . . . . 33
A Appendix 36
A.1 Java Web Start troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
A.1.1 Cannot start Workbench when offline . . . . . . . . . . . . . . . . . . . . . 36
Index 36
Chapter 1
Introduction
If you are in charge of installing and maintaining CLC Workbenches in your organization, you
probably have a lot of questions about installers, licenses, and where do we put files for this and
that.
This manual is written to answer these questions. It is primarily aimed towards client installations
(i.e. the CLC Workbenches). For installing the CLC Bioinformatics Database and CLC Server,
please refer to the installation chapter in the respective user manuals (see http://www.
clcbio.com/usermanuals).
At the moment, the CLC Workbenches are:
In addition, there is the CLC Sequence Viewer which is very similar to the other Workbenches
except when it comes to licensing (it is free) and plug-ins (some plug-ins cannot be installed in
the CLC Sequence Viewer).
In the first part of this manual, we take a closer look at the deployment strategies. Then we
go into details of the license system, followed by an explanation of the concept of plug-ins and
workflows. Finally there is an overview chapter which tells you where to find all the different
files in a client installation. This is useful when deploying the Workbench but also for back-up
purposes.
We will not go into details of Linux installations, but if you need these details, please contact us.
Questions, comments and feedback on this manual are very welcome at [email protected].
6
CHAPTER 1. INTRODUCTION 7
a central configuration, and whenever changes to this configuration is made, the user versions
are automatically updated. The Java Web Start support is described in section 2.
The second strategy requires more work on the administrator side to make sure that local
installations are updated, as there are no special tools provided by CLC bio. However, the
Workbench is very self-contained and simple to configure, so this solution has proven to work in
many cases. Note that this is a brief summary of the steps to take, and that the rest of this
manual is devoted to more detailed information.
1. On a single computer with the same OS as the target computers, complete a full installation
including:
2. Run the installer in silent mode (-q) on all the target computers
3. Copy the following files from the installation in 1) to the target computers
We recommend you tailor this strategy to your own organization - this is just an example of how
it can be done.
Creating a GHOST image or similar to copy to all the target computers is also a possibility, but
there are a few routines performed by the installer that need to be taken into account, especially
allocating memory (see section 10.1) and the creation of shortcuts and file associations.
• Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server 2003 or Windows
Server 2008
• Mac OS X 10.6 or later. However, Mac OS X 10.5.8 is supported on 64-bit Intel systems.
• Linux: Red Hat 5.0 or later. SUSE 10.2 or later. Fedora 6 or later.
• 32 or 64 bit
• Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server 2003 or Windows
Server 2008
• Mac OS X 10.6 or later. However, Mac OS X 10.5.8 is supported on 64-bit Intel systems.
• Linux: Red Hat 5.0 or later. SUSE 10.2 or later. Fedora 6 or later.
System requirements
∗ A graphics card capable of supporting OpenGL 2.0.
∗ Updated graphics drivers. Please make sure the latest driver for the graphics card
is installed.
System Recommendations
∗ A discrete graphics card from either Nvidia or AMD/ATI. Modern integrated graphics
cards (such as the Intel HD Graphics series) may also be used, but these are
usually slower than the discrete cards.
∗ A 64-bit workbench version is recommended for working with large complexes.
• Special requirements for read mapping. The numbers below give minimum and rec-
ommended memory for systems running mapping and analysis tasks. The require-
ments suggested are based on the genome size. Systems with less memory than
specified below will benefit from installing the legacy read mapper plug-in (see http:
//www.clcbio.com/plugins). This is slower than the standard mapper but adjusts to
the amount of memory available.
• Special requirements for de novo assembly. De novo assembly may need more memory
than stated above - this depends both on the number of reads, error profile and the
complexity and size of the genome. See http://www.clcbio.com/white-paper for
examples of the memory usage of various data sets.
• 64 bit computer and operating system required to use more than 2GB RAM
Chapter 2
The Java Web Start system is an easy way to maintain a central configuration of a Workbench
that is deployed to all end-users. CLC bio provides a tool with a web interface for systems
administrators to manage the Workbench configurations. The explanation of the various aspects
of Java Web Start deployment will be coupled with the explanation of how to use this tool in the
rest of this chapter.
Please let your contact person at CLC bio or [email protected] know that you are interested
in using Java Web Start for deploying Workbenches, and you will get access to downloading the
necessary resources through MyCLC: http://my.clcbio.com/.
2.2 System requirements for the Java Web Start Administration Tool
• Windows XP, Windows Vista, or Windows 7, Windows Server 2003 or Windows Server 2008
1
The standard shortcut will have a Java-style icon. To create a proper CLC Workbench icon, go to Help and Create
Shortcut when the Workbench is started.
2
Please refer to the appendix section A.1.1 if there are problems running the Workbench while off line.
10
CHAPTER 2. JAVA WEB START 11
• Mac OS X 10.6 or later. However, Mac OS X 10.5.8 is supported on 64-bit Intel systems.
• 32 or 64-bit
Password admin
The login mechanism of the tool is very simple. The user name and password are stored in a file
called tomcat-users.xml in the conf folder of the installation directory. Find the following
line to change this:
Figure 2.1: An overview of the web interface of the Java Web Start Administration Tool.
Once the .cwd file is downloaded, the file has to be uploaded through the administrator web
interface:
Manage Workbench Distributions | Manual install workbench distribution | Select
the .cwd/war file | Upload
Once the file is uploaded, switch to the Installed workbench distributions tab to see an overview
of the different Workbench versions installed (see figure 2.2.
Figure 2.2: One version of CLC Genomics Workbench and one version of CLC Main Workbench
are installed.
Clone existing profile This can be used if the new profile should be based on an existing profile.
Profile key This will be used to construct the link to the JNLP file that users need to launch the
Workbench.
Max memory This is the memory setting for starting the Workbench (learn more in section 10.1).
JVM parameters This is an advanced feature that should only be used if special instructions are
received from CLC bio Support or someone with expertise in Java Virtual Machines.
The link to the JNLP file for each profile can be found in this list. In addition, by clicking the
profile, it can be edited, deleted or exported. An exported profile can be imported again using the
tab next to Create Profile.
2.5.2 Plug-ins
Each profile can have a number of plug-ins pre-installed. End-users can also install plug-ins
unless this has been disallowed in the policy properties file (see chapter 8).
The plug-ins are managed for each profile under the Plug-ins section in the menu on the left hand
side. An overview of all plug-ins that are currently installed is shown, as well as the option to
download or manually install plug-ins.
When switching to Download plug-ins, a list of all plug-ins distributed by CLC bio is shown (see
figure 2.5)
Click a plug-in to read more. Use the checkboxes to select multiple plug-ins for convenient
download and installation. For plug-ins distributed by CLC bio that you install, when a new release
of the plug-in is made available, it will show up in the list of installed plug-ins with an option to
update it.
Property files
The web interface allows you to manipulate property files directly. When a new profile is created,
two property files are created per default (license information and policies), but others can be
created as well. For a an overview of all the properties that can be managed in this way, look at
section 11.2.
The property files can be edited directly in the web interface as shown for the license property
file in figure 2.6, or they can be uploaded.
Codon frequencies
In the Workbench installation folder under res, there is a folder named codonfreq. This folder
contains all the codon frequency tables organized into subfolders in a hierarchy. These are
available in the Workbench user interface when performing reverse translation.
If additional frequency tables are needed, they can be added to the profile by selecting the Codon
frequencies and clicking browse and Upload to upload a new codon frequency file.
The format needs to follow the format of the cftbl files located in the Workbench installation
folder, so please use one of these as template.
Note! Please be aware that this process needs to be handled carefully, since there are no
validation of the format of the files provided.
CHAPTER 2. JAVA WEB START 16
url = http://localhost:9090/WebStartAdministration122/bugreport/
Whenever a bug submission is received, an email can be sent from the Java Web Start
Administration Tool if all the email settings are filled in. Multiple receivers can be specified by
separating them with commas. If the string properties-username is entered in any of the
receiver or sender address fields, it will be replaced with the operating system username from
the bug submission.
By default each bug submission is saved to the system tmp folder. Below the mail settings
CHAPTER 2. JAVA WEB START 17
fields, you can enter a path to a Backup directory which will then be an alternate place to save
each bug submission. This should be an absolute path.
When the workbench starts up it is written in the log where the bug submission is directed to.
There is a test email config button and test backup directory button at the very bottom of the
page.
Chapter 3
This chapter deals with the installer and related information about the installation process when
using the standard installers rather than Java Web Start.
3.1.1 Java
The Workbenches are based on Java, and this means that there has to be a Java Runtime
Environment (JRE) on the computer to run the Workbench. For both Linux and Windows, the
installers have a built-in JRE that will be installed in the installation directory of the Workbench.
The advantage of this is twofold:
1. For computers who do not already have a JRE installed, the need for downloading and
installing a JRE is eliminated.
2. For computers who already have a JRE installed, there will never be compatibility problems
because the Workbench always uses its own JRE.
The built-in JRE is the latest Java 6 JRE from Sun Microsystems (http://java.sun.com).
The JRE used for running the CLC Workbench will not interfere with existing JREs on the computer.
Java on Mac
Since the Workbench uses Apple's JRE, there is no JRE included in the installer. When running
the Workbench on 64-bit systems, please make sure that the 64-bit Java is used for launching
applications:
Go to /Applications/Utilties/Java and double click on Java Preferences. In the Java
application versions, reorder the list to have JRE6/64 bit at the top. Note that this may change
the behavior of other Java-based programs on the computer.
18
CHAPTER 3. INSTALLATION WITHOUT JAVA WEB START 19
Note! After setting the Java preference, you can either uninstall and reinstall the workbench
and the memory will be adjusted automatically, or you can set the memory yourself (see section
10.1).
Note that each major version of a Workbench has its own installation directory. This means that
when upgrading from e.g. CLC Main Workbench 5 to CLC Main Workbench 6, the old installation
directory of version 5 will be left untouched when you install CLC Main Workbench 6.
If you wish to remove the old installation, please run the Uninstall program.
Minor updates will use the existing installation directory of the Workbench.
The installation directory can be defined during installation - the above are the default installation
directories (see section 3.3 for more information on how to define the installation directory).
License
Fixed license A license order ID has to be activated against our server for each computer. The
license will then be fixed to this computer. This requires manual intervention for each
activation.
Floating license A license server is installed in your organization. It hosts a number of licenses
which can be shared among all computers. Note that the license server is available for
both Linux, Windows and Mac OS X.
For large installations, the floating license is by far the best option, since all the license
administration takes place on the server (find the manual for the license server together with the
server distribution). The fixed license requires manual work during installation and also if the
licenses need to be updated.
For information on how to use the floating license, please refer to the user manual for the relevant
Workbench (see http://www.clcbio.com/manuals).
Plug-ins use the same licensing system as the Workbenches, so all the concepts described here
also apply to the plug-in licenses.
21
CHAPTER 4. LICENSE 22
serverip=
serverport=6200
disableborrow=false
autodiscover=true
useserver=true
Since all this information is stored in a file in the installation directory, it can easily be copied
to all clients, and the license configuration is completed. When the Workbench is started, it will
look in this file, and if useserver=true then it will try to connect to the license server, and no
license dialogs will be shown to the user.
You can download a sample license.properties file at http://clcbio.com/files/
deployment/license.properties.
Chapter 5
There is a graphical user interface to install plug-ins called the Plug-in Manager ( ) which is
invoked in the Help menu (see figure 5.1).
Plug-ins are either general modules or extensions provided by CLC bio (see http://www.
clcbio.com/plugins) or can be custom-made plug-ins specific to your organization.
Plug-ins can either be downloaded and installed directly in the Plug-in Manager, or they can be
installed from a file Install from File button at the bottom of the Plug-in Manager.
Resources are installed in the same way as plug-ins. Resources can be e.g. PFAM databases
used by the Workbench's PFAM Domain Search ( ).
Installing a plug-in is basically just a matter of putting files in the right folder. All plug-in files are
put in plugins and all resources in resource in the installation directory. This means that
the contents of these folders can be copied to other computers, and they will have the plug-ins
installed.
Licenses for the plug-ins are handled the same way as the Workbench licenses, see section 4.
23
Chapter 6
Workflows
There is a graphical user interface to install workflows called Workflows ( ) which is invoked in
the Help menu (see figure 6.1).
Workflows can be created in any workbench and distributed as an installer file that can be
installed in any workbench or server.
A workflow is always installed per user. The workflow definition is stored in the user home (see
section 11.3).
The ability to install workflows can be disabled by the policy (see section 8).
When the workbench is part of a CLC Genomics Server set-up, it will be a great advantage to
manage workflows to be used by all users on the server which means no local deployment
when workflows are updated (see more in the user manual at http://www.clcbio.com/
usermanuals).
24
Chapter 7
Information about server name and port can be stored in a file called serverinfo.properties
in the settings folder in the Workbench installation directory. When the user opens the log-in
dialog, the Workbench will read in the information from this file.
If the file does not exist, the information that the user enters will be saved in the user settings.
The user name and password is stored with the user settings.
The serverinfo.properties file contains the following:
port=7777
host=hostname
Since all this information is stored in a file in the installation directory, it can easily be copied to
all clients.
You can download a sample serverinfo.properties file at http://clcbio.com/files/
deployment/serverinfo.properties.
25
Chapter 8
Security policies
The Workbench has a security policy configuration that enables administrators to restrict users'
access to:
• Tools accessing services on the internet. This includes NCBI BLAST, NCBI and Uniprot
Searches.
• Notifications about updates. Update notifications on new Workbench and plug-in versions
• Plug-in management. Installation of plug-ins.
26
CHAPTER 8. SECURITY POLICIES 27
plugin_manage Controls whether the Plug-ins and Resources manager should be available for
the user. Note that users can still install plug-in updates if plugin_download and
plugin_version_check are allowed.
plugin_file_install Controls whether this user should be allowed to install plug-ins and resources
from a local file.
plugin_download Controls whether this user should be allowed to install CLC plug-ins and
resources downloaded directly within the plug-in manager dialog. This also includes
manually checking for updated plug-ins in the Plug-ins and Resources manager dialog and
also the automatic check for plug-in updates at start-up.
workflow_file_install Controls whether the user should be allowed to install workflows from a
file.
workflow_download Controls whether the user should be allowed to download and install
workflows form the CLC workflows repository.
This chapter explains how data is stored, gives general guidance on size of data, and outlines
configurations needed for running analyses on large amounts of data.
Figure 9.1: In this example the location called 'CLC_Data' points to the folder at C:\Documents and
settings\clcuser\CLC_Data.
28
CHAPTER 9. STORING AND BACKING UP DATA 29
Figure 9.2: Mousing over the location called 'CLC_Data' shows the full path to the system folder,
which in this case is C:\Users\boester\CLC_Data.
If the Workbench is connected to a CLC Server, the server's locations will automatically show up
when the user is logged in. This chapter does not deal with server locations - please refer to the
server user manual at http://www.clcbio.com/usermanuals.
The list of locations is stored in a file called model_settings_300.xml in the settings
folder in the user home (see section 11.3). We do not recommend manual editing of this file,
although it is standard xml.
datadir = c:\clcdata
The following variables can be used to construct the path to the desired location:
A few examples:
Note that the folder does not need to exist - it will be created if needed. You can use both slash
(normally used on Mac and Linux) and backslash (normally used on Windows) in the configuration
file - the Workbench will use the appropriate one depending on the platform.
Note that the default location is only considered the very first time the Workbench starts. When
the Workbench closes the first time, the model_settings_300.xml file is created and this is
where it will look for the locations further on. Deleting this file will make the Workbench look for
CHAPTER 9. STORING AND BACKING UP DATA 30
tmpdir = /path-to-temp
Instead of "/path-to-temp" you write the absolute path to the new tmp directory. When the
Workbench is restarted, it will then use the new directory for storing temporary data.
You can download a sample path.properties file at http://clcbio.com/files/deployment/
path.properties.
Note! It is imperative for acceptable performance that data transfer to the temp directory is not
over a network connection. Since the Workbench will spend a lot of time writing and reading
these files, disk speed has a great impact on overall performance when working with large data
sets.
CHAPTER 9. STORING AND BACKING UP DATA 31
• Reads are imported and take up space as raw reads (see details below). Once imported,
you can delete the original sequence file if you do not need it for other purposes.
• When the data has been assembled, either de-novo or against a reference, they take up
space once again (this time more space since there is also information about where they
map etc).
• The computer doing the analysis needs space for tmp files. Once the assembly is done,
the temporary files are deleted. The temporary files usually do not take up more space than
the final result
System resources
You can specify the amount of system resources that the Workbench is allowed to use. This can
be done for both memory and CPU.
-Xmx1400m
The number (1400) is the amount of memory the Workbench is allowed to use. The file is placed in
the installation folder (e.g. C:\Program Files\CLC Main Workbench 6\clcmainwb.vmoptions).
On Mac OS X, the -Xmx value is stored in Info.plist in the application bundle (Control-click
the application and choose "Show Package Contents").
The value is set to 50% of the computers RAM per default, and at a maximum of 1400 MB for
32-bit systems and a maximum at 50GB for 64-bit systems (at this point there is no performance
increase by setting it higher because of overheads in memory management). If you do not wish
to use the installer on each computer and use an image instead, either make sure all computers
have the same amount of RAM, or set the number to 50% of the computer with the smallest
amount of RAM (this value should not be lower than 200 MB, and for genomics-scale data, it
should be significantly higher).
You can manually change the contents if you want to raise or lower this value. In case you get
an out of memory error you may need to allocate a higher amount of memory by increasing the
number in the .vmoptions file. In case you get a java exception saying "The JVM could not be
started. The main method may have thrown an exception" you may need to decrease the value
in the .vmoptions file.
32
CHAPTER 10. SYSTEM RESOURCES 33
maxcores = 1
Instead of 1 you write the maximum number of cores that the Workbench is allowed to use.
When the Workbench is restarted, it will comply with this setting. Please note that this is not a
guarantee that the Workbench will never use more, but that will be for very brief and infrequent
peaks and should not affect performance of other applications running on your system. The only
exception is when a user starts several jobs to run simultaneously.
You can download a sample cpu.properties file at http://clcbio.com/files/deployment/
cpu.properties.
Chapter 11
This part gives you an overview of all the parts of a CLC Workbench installation. Some parts are
at the level of the computer, i.e. shared by all users, whereas other parts are at the user level.
Licenses The license information depends on what kind of license you use:
Fixed licenses Stored in the licenses folder (they are unique for each computer)
Floating licenses Information about the license server connection is stored in the license.properties
file in the settings folder.
Plug-ins are stored in the plug-ins folder and can be copied to other computers running the
same version of the Workbench.
Resources are stored in the resource folder and can be copied to other computers running the
same version of the Workbench.
Various property files are stored in the settings folder. See a full overview in section 11.2
In addition, file associations for .clc files are stored in the registry database on Windows.
policy.properties for specifying if certain options should be removed from the Workbench (see
section 8).
path.properties for specifying default data location (see section 9.1.2) and where to store
temporary data (see section 9.3.1)
34
CHAPTER 11. OVERVIEW - WHERE DO WE PUT THINGS? 35
serverinfo.properties specify host and port for connecting to a CLC server (see section 7).
proxy.properties holds information about proxy server (when the Workbench needs access to
online services). Read more in the Workbench user manual at http://clcsupport.com/
clcgenomicsworkbench/602/index.php?manual=Network_configuration.html
license.properties holds information about the host and port of a license server that should be
used (see section 4.1).
cpu.properties specifies the maximum number of cores to be used by the Workbench (see
section 10.2).
bugsubmission.properties specifies the server to receive bug reports. You need to use the Java
Web Start Administration Tool to be able to use this feature (see section 2.6).
Linux $HOME/.clcbio
User settings The user settings file stores information such as view settings, parameters,
workspaces, user name and password to server log-ins and other settings customized
by the user. The user settings file is found in the application data folder under
settings/workbench name/version name. It is recommended to back up the
user settings file. When upgrading to a new version of the Workbench, the user's old user
settings file is copied by the Workbench the first time it is run.
Locations Information about which locations the user has added in the Navigation Area is stored
in the model_settings_300.xml file in the settings folder.
Error logs to be sent to [email protected] for use in case of program errors are output.log
and error.log and they are found in the log folder.
Appendix
• Launch Javaws explicitly from a command line using javaws -offline <jnlp_url>
• Launch the cached application via Java Cache Viewer, see http://docs.oracle.com/
javase/tutorial/deployment/webstart/running.html#cache
36
Index
37