@@ -17,7 +17,7 @@ Within the sorted list of resulting candidate constellations, each will report i
...
@@ -17,7 +17,7 @@ Within the sorted list of resulting candidate constellations, each will report i
The following stages are proposed for the Identity Reconciliation Engine:
The following stages are proposed for the Identity Reconciliation Engine:
*Candidate Constellation producing stages.*These stages produce lists of candidate constellations.*
***Candidate Constellation producing stages***These stages produce lists of candidate constellations.*
* Elastic Search Name Entry (Heading) : *Search the entire name entry from the query constellation against all constructed name entry strings from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
* Elastic Search Name Entry (Heading) : *Search the entire name entry from the query constellation against all constructed name entry strings from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
* Elastic Search Name (Name-only) : *Search the name portion of the name entry from the query constellation against all constructed name-only strings from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
* Elastic Search Name (Name-only) : *Search the name portion of the name entry from the query constellation against all constructed name-only strings from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
* Elastic Search Surname : *Search the surname component from the query constellation against all surnames from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
* Elastic Search Surname : *Search the surname component from the query constellation against all surnames from SNAC identity constellations. Return the top N constellations based on Elastic Search's search algorithm with Elastic Search native scores as the stage score for each candidate constellation.*
...
@@ -25,8 +25,8 @@ The following stages are proposed for the Identity Reconciliation Engine:
...
@@ -25,8 +25,8 @@ The following stages are proposed for the Identity Reconciliation Engine:
* Exist Dates : *Search the exist dates from the query constellation against all identity constellations in SNAC. Return all constellations that contain the exact exist dates as the query constellation.*
* Exist Dates : *Search the exist dates from the query constellation against all identity constellations in SNAC. Return all constellations that contain the exact exist dates as the query constellation.*
* Occupation : *Search the list of occupations from the query constellation against all identity constellations in SNAC. Return all constellations that match all occupations in the query constellation. (The candidates must have the entire list of occupations from the query as a subset of their occupation list.)*
* Occupation : *Search the list of occupations from the query constellation against all identity constellations in SNAC. Return all constellations that match all occupations in the query constellation. (The candidates must have the entire list of occupations from the query as a subset of their occupation list.)*
* Place : *Search the list of places from the query constellation against all identity constellations in SNAC. Return all constellations that match all places in the query constellation. (The candidates' list of places must be a superset of the query constellation's places.)*
* Place : *Search the list of places from the query constellation against all identity constellations in SNAC. Return all constellations that match all places in the query constellation. (The candidates' list of places must be a superset of the query constellation's places.)*
*Candidate Constellation list modifying stages.*These stages take lists of candidate constellations and modify or replace the scores for the input results.*
***Candidate Constellation list modifying stages***These stages take lists of candidate constellations and modify or replace the scores for the input results.*
* Name Entry Length : *Compute the difference in length between the candidate constellation's constructed name entry string and the query constellation's constructed name entry string. Replace the original stage score with the log of the difference.*
* Name Entry Length : *Compute the difference in length between the candidate constellation's constructed name entry string and the query constellation's constructed name entry string. Replace the original stage score with the log of the difference.*
* SNAC Degree Sort : *Replace the original stage score with the number of constellation relations (constellation out-degree) in the candidate constellation. Constellations that are more connected with other SNAC identity constellations will get better scores.*
* SNAC Degree Sort : *Replace the original stage score with the number of constellation relations (constellation out-degree) in the candidate constellation. Constellations that are more connected with other SNAC identity constellations will get better scores.*
* SNAC Resource Count Sort : *Replace the original stage score with the number of resource relations (resource out-degree) in the candidate constellation. Constellations with higher resource relations will get better scores.*
* SNAC Resource Count Sort : *Replace the original stage score with the number of resource relations (resource out-degree) in the candidate constellation. Constellations with higher resource relations will get better scores.*
*Multi-Stage.*This stage allows the execution engine to run multiple stages in sequence, feeding the results of one stage as the input to the next. It results in one final list of candidate constellations from all independent stages it ran.*
***Multi-Stage***This stage allows the execution engine to run multiple stages in sequence, feeding the results of one stage as the input to the next. It results in one final list of candidate constellations from all independent stages it ran.*