LoginHomeBugs & FeaturesDocumentationDownloadForums
ID:81
Status:4 - Pending
Priority:1 - Request for Enhancement, e.g. asking for a feature
Scope:3 - Applies to all machines and operating systems
Subsystem:Core API
Summary:No support for internal entities
Version:Future
Created By:scottp
Assigned To:Michael Sweet
Fix Version:Unassigned
Update Notification:

Receive EMails Don't Receive EMails

Trouble Report Files:

Post File

No files

Trouble Report Dialog:

Post Text

Name/Time/DateText
scottp
09:17 Apr 11, 2008
I'm unable to load a document tree that contains internally defined entities. I've included a simple file below that refuses to load for me. Internal (parsed) entities are quite common in the publishing world and are definitely part of the core XML spec ..

http://www.w3.org/TR/2004/REC-xml-20040204/#sec-internal-ent

Thanks!

...scott


----------------------------------------------------
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE topic PUBLIC "-//OASIS//DTD DITA Composite//EN" "ditabase.dtd" [
<!ENTITY myent "TEST">
]>
<topic id = "id0844H000R5Z" >
<title>dita</title>
<body>
<p>dita file <ph>&myent;</ph></p>
</body>
</topic>
---------------------------------------------------
scottp
14:35 Apr 24, 2008
Ah .. I see that it's not the definition of the internal entities that's causing the trouble, it's the actual use of those entities. And I see in the docs that minixml does not support entities other than those required by the XML spec. This is a real problem for me since the documents that I'm parsing may have internal entities or entities defined in a DTD. Is there some way to just have the parser skip over the entities?
Michael Sweet
11:44 Oct 26, 2008
OK, I've reviewed the spec, and the current Mini-XML API cannot be extended to support internal (or external) entities without breaking binary compatibility.

Aside from needing to support multi-character substitution (the current API only supports substitution of single Unicode characters), it would be better to track the entities within a document rather than having a global (to the current thread) scope.

In any case, I don't know whether this will be implemented any time soon (or if it is ever implemented), since Mini-XML isn't designed as a validating XML parser that can import/interpret processing directives.