RDB Archive Engine Service

The RDB archive engine reads samples from PVs and writes them to an RDB. The RDB may be MySQL, PostgreSQL or Oracle. For smaller setups and to get started, MySQL is very straight forward and will be described in here. For a production setup, PostgreSQL or Oracle can use partitioned table space that allows better data management over time. See https://github.com/ControlSystemStudio/phoebus/tree/master/app/databrowser-timescale for more on using PostgreSQL with the TimescaleDB extension to partition data and get optimized data retrieval.

Once the RDB is configured with the archive table schema, the archive engine is used both as a command line tool to configure the archive settings and as a service to write samples from PVs to the RDB. You can build the archive engine from sources or fetch a binary from https://controlssoftware.sns.ornl.gov/css_phoebus

Below are examples using either MySQL or PostgreSQL.

Install MySQL (Centos Example)

Install:

sudo yum install mariadb-server

Start:

sudo systemctl start mariadb

Set root password, which is initially empty:

/usr/bin/mysql_secure_installation

In the following we assume you set the root password to $root. To start RDB when computer boots:

sudo systemctl enable mariadb.service

Install PostgreSQL (RHEL 9.6)

Install:

sudo yum install postgresql-server

Key setup steps from /usr/share/doc/postgresql/README.rpm-dist:

sudo postgresql-setup --initdb
sudo systemctl start postgresql.service
sudo systemctl enable postgresql.service

By default, access to the database is extremely limited. In later examples, we want to allow a user archive to write data and a user report to read. An easy way to allow this from localhost is the following:

sudo su - postgres

# Add this to /var/lib/pgsql/data/pg_hba.conf
host all  all     127.0.0.1/32      password

pg_ctl reload

Note that the password type of access is not secure. Study the PostgreSQL documentation for secure alternatives that meet your requirements.

Create archive tables

For MySQL, connect to mysql as root:

mysql -u root -p'$root'

and then paste the commands shown in services/archive-engine/dbd/MySQL.dbd (available online as https://github.com/ControlSystemStudio/phoebus/blob/master/services/archive-engine/dbd/MySQL.dbd ) to create the table setup for archiving PV samples.

The provided database schema is meant as an example, concentrating on the essential tables. It uses a single large sample table. A production setup might prefer to partition the table by for example creating a new partition each month.

The schema as provided does not rely on table constraints. For example, while the chan_grp.eng_id should refer to a valid smpl_eng.eng_id, there may not be a foreign key constraint to enforce this. This has been done to minimize RDB overhead, using the database simply as “storage” and enforcing the correctness of the data inside the archive engine when it is importing a configuration or adding samples. For a production setup, you may want to add or remove constraints as desired.

For PostgreSQL, connect to the database as the postgres super user:

sudo su - postgres -c psql

and create the archive tables as shown in https://github.com/ControlSystemStudio/phoebus/blob/master/services/archive-engine/dbd/postgres_schema.txt

You should not simply copy/paste the whole file into the psql shell. For example, commands for creating accounts and setting permissions are shown in the file as comments, where some of them need to be executed _before_ and others _after_ creating all the archive tables. You need to decide if you want to paste those commands as shown in the comment, or change for example the passwords to your liking.

To test access, try this from any user account on the same host:

psql -h 127.0.0.1 -U archive -W archive
Password: $archive
SELECT * FROM channel;
\q

psql -h 127.0.0.1 -U report -W archive
Password: $report
SELECT * FROM channel;
\q

Example for python access:

pip install psycopg

python3

import psycopg
with psycopg.connect("host=127.0.0.1 user=report password=$report dbname=archive") as conn:
    with conn.cursor() as cur:
        cur.execute("SELECT channel_id, name FROM channel")
        for row in cur:
            print("#%5d - %s" % (row[0], row[1]))

View Archive Data

The default settings for the Phoebus Data Browser check for archived data in mysql://localhost/archive. To access MySQL on another host, change these settings in your Preferences Listing

org.csstudio.trends.databrowser3/urls=jdbc:mysql://my.host.site.org/archive|RDB
org.csstudio.trends.databrowser3/archives=jdbc:mysql://my.host.site.org/archive|RDB
org.phoebus.archive.reader.rdb/user=report
org.phoebus.archive.reader.rdb/password=$report

The MySQL.dbd used to install the archive tables adds a few demo samples for sim://sine(0, 10, 50, 0.1) around 2004-01-10 13:01, so you can simply add that channel to a Data Browser and find data at that time.

For PostgreSQL, change the URLs to jdbc:postgresql://my.host.site.org:5432/archive

In case of connection problems, you may want to start with my.host.site.org replaced by 127.0.0.1 and running on the database host. Use the MySQL or PostgreSQL command line tools to test connections with the same host, port, user and password.

List, Export and Import Configurations

List configurations:

archive-engine.sh -list
Archive Engine Configurations:
ID  Name     Description        URL
 1  Demo     Demo Engine        http://localhost:4812

Extract configuration into an XML file:

archive-engine.sh -engine Demo -export Demo.xml

For a description of the XML schema, see archive_config.xsd.

Modify the XML file or create a new one to list the channels you want to archive and to configure how they should be samples. For details on the ‘scanned’ and ‘monitored’ sample modes, refer to the CS-Studio manual chapter http://cs-studio.sourceforge.net/docbook/ch11.html

Finally, import the XML configuration into the RDB, in this example replacing the original one:

archive-engine.sh -engine Demo -import Demo.xml -port 4812 -replace_engine

PV Name Details

The archive engine uses CS-Studio PV names. “ca://xxxx” will force a Channel Access connection, “pva://xxxx” will force a PV Access connection, and just “xxxx” will use the default PV type configurable via

org.phoebus.pv/default=ca

Since EPICS 7, IOCs can support both protocols. “xxxx”, “ca://xxxx” and “pva://xxxx” will thus refer to the same record on the IOC.

The preference setting

org.csstudio.archive/equivalent_pv_prefixes=ca, pva

causes the archive engine to treat them equivalent as well. For details, refer to the description of the equivalent_pv_prefixes preference setting.

Run the Archive Engine

To start the archive engine for a configuration:

archive-engine.sh -engine Demo -port 4812 -settings my_settings.ini

The engine name (‘Demo’) needs to match a previously imported configuration name, and the port number (4812) needs to match the port number used when importing the configuration.

The settings (my_settings.ini) typically contain the EPICS CA address list settings as well as archive engine configuration details, see archive engine settings in Preferences Listing.

In a production setup, the archive engine is best run under procServ (https://github.com/ralphlange/procServ).

The running archive engine offers a simple shell:

INFO Archive Configuration 'Demo'
...
INFO Web Server : http://localhost:4812
...
>
> help
Archive Engine Commands:
help            -  Show commands
disconnected    -  Show disconnected channels
restart         -  Restart archive engine
shutdown        -  Stop the archive engine

In addition, it has a web interface accessible under the URL shown at startup for inspecting connection state, last archived value for each channel and more. The engine can be shut down via either the shutdown command entered on the shell, or by accessing the stop URL. For the URL shown in the startup above that would be http://localhost:4812/stop.