Force full name reconstruction for taxon tree

pverley · July 17, 2025, 11:51am

Hi there,

Is there a way to force the Taxon.fullName reconstruction for every taxon already present in the tree? (for both isPreferred=true taxa and synonyms)

Thanks !

Grant · July 17, 2025, 6:11pm

Hi @pverley,

Thank you for your question! Currently, there is no way to instruct Specify to rebuild all Taxon full name fields without modifying the ranks in the Taxon tree itself or by editing one of its parent taxa (or grandparent or great-great-grandparent).

Rebuilding full names for taxon nodes that are not preferred is not possible at all at the moment, as our logic explicitly excludes them when the names are rebuilt.

I’ve added a feature request for this capability to our GitHub, including support for a parameter to rebuild synonymized names as well:

github.com/specify/specify7

Add API endpoint to rebuild `fullName` fields in a tree

opened 06:07PM - 17 Jul 25 UTC

grantfitzsimmons

**Problem** Currently, there is no straightforward way to rebuild the `fullName…` fields for our trees (`taxon`, `geography`, `storage`, `tectonicunit`, `geologictimeperiod`, and `lithostrat`). This was most recently requested by Philippe Verley at IRD on the [Speciforum](https://discourse.specifysoftware.org/t/editing-tree-definitions-ranks/1783/5?u=grant), but we have been asked by users for quite some time for a more straightforward approach to rebuilding names. **Describe the solution you'd like** I would like a new API endpoint that allows a user to trigger a rebuild of the `fullName` field for all nodes within a specified tree. This endpoint should be flexible enough to handle all supported tree types in Specify. The endpoint should accept arguments to specify which tree to rebuild. * It must support the following tree types: `taxon`, `geography`, `storage`, `tectonicunit`, `geologictimeperiod`, and `lithostrat`. * The user must provide the specific `id` for the tree that needs rebuilding. * By default, the operation should only rebuild the `fullName` for preferred nodes (`isPreferred` = `true`). * An optional boolean argument, such as `rebuild_synonyms`, should be included. When set to `true`, it would rebuild the `fullName` for all nodes in the tree, including synonyms. We should be able to build an interface for this when #6124 comes along. The API calls might look like this: **Rebuilding preferred names for a taxon tree:** ```http POST /api/specify/trees/taxon/2/rebuild-full-name ```` **Rebuilding all names (including synonyms) for a geography tree:** ```http POST /api/specify/trees/geography/4/rebuild-full-name?rebuild_synonyms=true ``` **Describe alternatives you've considered** The main alternative is the current method of running a SQL script directly on the database or making a modification to the tree definition items and reverting it afterwards. This is not a good long-term solution, and there is no option now to rebuild names for synonyms beyond unsynonymizing them temporarily. **Reported By** IRD (Institut de Recherche pour le Développement) **Additional context** This feature was most recently requested by Philippe Verley at IRD on the [Speciforum](https://discourse.specifysoftware.org/t/editing-tree-definitions-ranks/1783/5?u=grant)

NielsKlazenga · July 18, 2025, 4:49am

Repair tree does that, doesn’t it? If that is not already in the API, it might be better to add that than doing something special for the names.

Grant · July 23, 2025, 8:54pm

Hi @NielsKlazenga,

The “Repair Tree” option only rebuilds the node numbers for the selected tree, not the full names. It might just be the right place to integrate this functionality into the UI!

Technical Details

When you click Repair Tree in the User Tools menu, it runs two functions to renumber the tree and validate that the numbering is correct.

`renumber_tree` function

This function repairs or rebuilds the tree numbering system by:

Updating each node’s rank to match its full name definition in the tree schema
Checking for and warning about invalid parent-child rank relationships
Creating a complete path enumeration for each node in the tree
Assigning new node numbers based on the hierarchical paths
Setting proper highest child node numbers for parent nodes
Clearing any maintenance flags in the system related to tree nodes

`validate_tree_numbering` function

This function checks if the hierarchical tree structure is valid by:

Verifying that all nodes have nodenumber and highestchildnodenumber set
Ensuring children have higher ranks than their parents (maintaining proper hierarchy)
Confirming that child node numbers are properly nested within their parent’s range

Code References

specify7/specifyweb/specify/tree_views.py at f9cb3421767993a8c0ff504f86e94187d79a3dd9 · specify/specify7 · GitHub
specify7/specifyweb/specify/tree_extras.py at f9cb3421767993a8c0ff504f86e94187d79a3dd9 · specify/specify7 · GitHub

pverley · July 23, 2025, 10:23pm

Thank you for the feature request and +1 for adding the feature in the Repair tree: it is actually the first thing I attempted without much thinking, so I guess it is kind of intuitive to expect it here.

The API offers a predict_fullname path and it looked like the perfect opportunity to practice.

Prerequisite: How to use the Specify API as a generic webservice

Disclaimer: the following drill will makes use of PUT request that do alter the database. Even though the API implements optimistic locking which is safer than meddling with the SQL database, I’d say it is advisable to backup the database before running any PUT/POST/DELETE API request.

API predict_fullname

/api/specify_tree/{tree}/{parentid}/predict_fullname/ Returns the predicted fullname for a node based on the name field of the node and its . Requires GET parameters treedefitemid and name, to indicate the rank (treedefitem) and name of the node, respectively.

URL parameters:

{tree} name of the tree. taxon in this case.
{parentid} ID of the Parent of Taxon taxon.parent.id

GET parameters:

{treedefitemid} : Taxonomic rank ID taxon.taxonomicRank.id
{name} : Name of the taxon taxon.name

API taxon

After predicting the taxon fullName, I will need to (i) get the taxon version and (ii) update the taxon fullname. It can be achieved with respectively a GET and a PUT request with api/specify/taxon/{taxonid}/

Requesting synonym taxa

Even though I did not mention it in my initial post, I only need to regenerate full names for infraspecific taxa (subspecies, variety and forma in our case).

I crafted some queries that would give me TaxonID, Taxon name, Taxonomic rank ID, Parent of Taxon ID . For instance:

Taxon ID	Taxon name	Taxonomic Rank ID	Parent of Taxon ID
58906	alata	14	18448
58907	leucostachyus	14	31783
58909	diffusa	14	14263
58921	octandra	14	15064
…	…	14	…

Queries results were exported as CSV files.

API calls

For the sake of clarity and brevity, I assume that connection has been established beforehand.

#! /bin/bash
# path of the request results 
FILE=repair-taxon-fullname_subspecies.csv
# CSFRToken obfuscated here, the one from cookies.txt
csrftoken=*********
while IFS="," read -r taxonid name treedefitemid parentid
do
  echo "parentid: $parentid"
  echo "treedefitemid: $treedefitemid"
  echo "name: $name"
  # generate fullname
  fullname=$(curl -s -b cookies.txt -G "https://specify.herbier-guyane.fr/api/specify_tree/Taxon/${parentid}/predict_fullname/"  --data-urlencode "treedefitemid=${treedefitemid}" --data-urlencode "name=${name}")
  echo "fullname: $fullname"
  # get taxon version
  version=$(curl -s -b cookies.txt -G "https://specify.herbier-guyane.fr/api/specify/taxon/${taxonid}/" | grep -o '\"version\": [0-9]*' | awk '{print $NF}')
  # update fullname
  curl -s -b cookies.txt -X PUT \
    -H "X-CSRFToken: $csrftoken" \
    -H "Referer: https://specify.herbier-guyane.fr/" \
    --data "{\"version\": $version, \"fullname\":\"$fullname\"}" \
    https://specify.herbier-guyane.fr/api/specify/taxon/$taxonid/ \
    | jq '.fullname'
  echo ""
done < <(tail -n +2 $FILE)

The whole script with login and logout:
repair-taxon-fullname.sh (2.2 KB)

Results

I had to generate and update ~3700 taxa. It ran in a few minutes with outputs such as:

parentid: 18448
treedefitemid: 14
name: alata
fullname: Irlbachia alata subsp. alata
"Irlbachia alata subsp. alata"

parentid: 31783
treedefitemid: 14
name: leucostachyus
fullname: Andropogon virginicus subsp. leucostachyus
"Andropogon virginicus subsp. leucostachyus"

etc.

A query on taxon full names with both isPreferred=Yes and isPreferred=No showed afterward that synonym full names had been reconstructed

My personal conclusion is that there is a learning curve to working with the API, but it is worth it a hundred times over for how useful and efficient it is

system · July 30, 2025, 10:23pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can one modify fields to be returned from a tree query (e.g. taxon tree) Get Help	1	37	September 2, 2025
Establishing relationship between synonym and preferred/accepted taxon, en masse Get Help	13	520	May 30, 2024
Trees in Specify :trees_: Trees Specify-7	4	1645	August 15, 2025
Issues searching synonyms in Specify query Get Help	0	85	May 15, 2024
Taxon Tree Definition Full Name Separator not taken into account Get Help	2	73	July 10, 2025