TEXUS: A task-based approach for table extraction and understanding