View on GitHub

linked-amarna

Tools and results from parsing Shlomo Izreel's transcriptions of the Amarna Letters

Amarna Personal Names

Tools and results from parsing Shlomo Izre’el’s transcriptions of the Amarna Letters. Izre’el’s transcriptions can be found here.

Linked Amarna

Process for Data Collection and Curation

Order of Scripts:

  1. name_greper.py. This script downloads the text from Shlomo Izre’el’s website, or uses a text file stored locally. It then prepares and generates the basic table for the database, as well as creates an Excel file that can be imported into OpenRefine. The columns might need to be adjusted in the Excel file to make them easier to work with.

  2. OpenRefine. I ran the cluster key collision, cluster levenshtein, cluster levenshtein with the radius updated to 2, and the cluster levenshetein with block 5. I should have ran the cluster key collision in the name_greper.py script, as the OpenRefine cluster key collision is not as good as can be created in python. The following normalization function is much superior to the one found in OpenRefine.

     import unicodedata 
    
     ...
    
     def remove_diacritics(string):
         for c in unicodedeata.normalize('NFD', string):
             if unicodedata.category(c)[0] != 'M':
                 yield c 
    
     ...
     ''.join(remove_diacritics(SOMESTRING)
    

    After the names have been clustered in OpenRefine, export the file as csv. The current version of the OpenRefine cluster is saved as AmarnaNames4.csv.

  3. name_database_generator.py. This function imports yaml file that I created from looking at the index of Moran’s translation of the Amarna Letters, and Hess’s Amarna personal names. The yaml file is in the data directory. This file also imports the csv file created from OpenRefine.

  4. name_comparer.py collects text variants in the database, and offers suggestions for linking canonical forms of the name to the text line. name_insert.py was used to expand the name information, also added additional information like reference to Hess’s Amarna Personal Names.

  5. d3_data_generator.py. This script generates the data used for the d3 visualization. The JSON file that is output from this script should be adjusted so each of the top level keys become variables, and the file needs to be renamed to graph4.js. This was for the purpose of making the file easier to import into the html page.

The Linked Amarna Visualization

The Visualization is available here.

Using the Visualization

The left hand panel contains a list of all of the names that are found in the database. Click on any of the names to display the sub graph of names connected to that person. The links between names shows that there is at least one text where both names are found. The left panel also has a search box where any form of the name can be searched, all of the transcriptions, and various spellings can be searched.

After a name is selected, a info box will appear in the bottom right hand corner. This info box can be expanded to show more information about the individual.

In the top right corner of the visualization is a toggle switch which will expand any subgraph to include friend of a friend relations. This will allow the user to see how an individuals network fits in the greater corpus.

About the Coding of the Visualization

The resulting data visualization uses a d3 force directed graph. Also fusejs is imported to allow for fuzzy searching of the names.

Bibliography