1. Hitachi Vantara - Pentaho Documentation
"Database lookup" step:
Reference for B: The documentation describes the "Load all data from table" option for the cache. This action inherently requires memory. The "Pentaho Performance Tuning" guide frequently discusses allocating sufficient heap space (-Xmx) for transformations that cache large datasets. Section: Cache.
Reference for E: The documentation explicitly states
"If you expect more than one row to be returned
the returned value will be the first one encountered." This confirms that only one matching row is used. Section: Options.
Reference for A (Incorrectness): The presence of fields to specify default values for lookup fields when a match is not found confirms that rows are not dropped
but rather are passed on with null/default values. Section: The fields to return.
2. Pentaho Data Integration Official Documentation (General Principles):
Reference for C: The underlying engine for PDI is Java. In-memory lookups using standard Java data structures like HashMap rely on the equals() and hashCode() methods of key objects. For java.lang.String
these methods are case-sensitive. This is a fundamental principle of the platform's operation when performing in-memory comparisons. This behavior is detailed in general Java and PDI developer guides.