"mysql-test/suite/rpl/t/rpl_insert.test" did not exist on "53def7bda7a5f7cc29ebb99ae083760d616b518d"
Commit f54e3fa6 authored by Kirill Smelkov's avatar Kirill Smelkov

BigFile: .data can be BTreeData or None or (possibly non-empty) str

Current BigFile code in many places assumes .data property is either
None or BTreeData instance. But as already shown in 4d8f0c33 (BigFile:
Fix non-range download of just created big file) this is not true and
.data can be an str.

This leads to situations where code wants to act on an str, like on a
BTreeData instance, e.g.

    def _range_request_handler():
        ...

        if data is not None:
            RESPONSE.setHeader('Last-Modified', rfc1123_date(data._p_mtime))

or

    def _read_data(... data=None):
        ...

        if data is None:
            btree = BTreeData()
        else:
            btree = data

        ...
        btree.write(...)

and other places, and in all those situation we'll get AttributeError
(str has neither ._p_mtime nor .write) and it will crash.

~~~~

.data can be str at least because '' is the default value for `data`
field in Data property sheet. From this proposition the code could be
reorganised to work in "data is either BTreeData or empty (= None or '')"

But we discussed with Romain and his idea is that non empty strings have
to be too supported because of compatibility reasons and because of
desire to support possible future automatic File-based documents
migration to BigFiles.

From this perspective for BigFile the invariant thus

    .data is either BTreeData or str (empty or not) or None.

This patch goes through whole BigFile code and corrects places to either
properly support str case, or None (in e.g. computing len(data) in
index_html).

In _read_data() if data is previously str - that means we are appending
content to this file and thus it is a good idea to first convert str
(empty or not) to BTreeData and then proceed with appending.
Helped-by: Vincent Pelletier's avatarVincent Pelletier <vincent@nexedi.com>
Reviewed-by: Romain Courteaud's avatarRomain Courteaud <romain@nexedi.com>
parent 28225e1b
...@@ -32,6 +32,34 @@ class BigFile(File): ...@@ -32,6 +32,34 @@ class BigFile(File):
""" """
Support storing huge file. Support storing huge file.
No convertion is allowed for now. No convertion is allowed for now.
NOTE BigFile maintains the following invariant:
data property is either
- BTreeData instance, or
- str(*), or
- None.
(*) str has to be supported because '' is a default value for `data` field
from Data property sheet.
Even more - for
a) compatibility reasons, and
b) desire to support automatic migration of File-based documents
from document_module to BigFiles
non-empty str for data also have to be supported.
XXX(kirr) I'm not sure supporting non-empty str is a good idea (it
would be simpler if .data could be either BTreeData or "empty"),
but neither I'm experienced enough in erp5 nor know what are
appropriate compatibility requirements.
We discussed with Romain and settled on "None or str or BTreeData"
invariant for now.
""" """
meta_type = 'ERP5 Big File' meta_type = 'ERP5 Big File'
...@@ -98,6 +126,11 @@ class BigFile(File): ...@@ -98,6 +126,11 @@ class BigFile(File):
if data is None: if data is None:
btree = BTreeData() btree = BTreeData()
elif isinstance(data, str):
# we'll want to append content to this file -
# - automatically convert str (empty or not) to BTreeData
btree = BTreeData()
btree.write(data, 0)
else: else:
btree = data btree = data
seek(0) seek(0)
...@@ -118,13 +151,10 @@ class BigFile(File): ...@@ -118,13 +151,10 @@ class BigFile(File):
def _data_mtime(self): def _data_mtime(self):
"""get .data mtime if present and fallback to self._p_mtime""" """get .data mtime if present and fallback to self._p_mtime"""
# there is no data._p_mtime when data is None. # there is no data._p_mtime when data is None or str.
# so try and fallback to self._p_mtime # so try and fallback to self._p_mtime
data = self._baseGetData() data = self._baseGetData()
if data is not None: mtime = getattr(data, '_p_mtime', self._p_mtime)
mtime = data._p_mtime
else:
mtime = self._p_mtime
return mtime return mtime
def _range_request_handler(self, REQUEST, RESPONSE): def _range_request_handler(self, REQUEST, RESPONSE):
...@@ -201,6 +231,7 @@ class BigFile(File): ...@@ -201,6 +231,7 @@ class BigFile(File):
'bytes %d-%d/%d' % (start, end - 1, self.getSize())) 'bytes %d-%d/%d' % (start, end - 1, self.getSize()))
RESPONSE.setStatus(206) # Partial content RESPONSE.setStatus(206) # Partial content
# NOTE data cannot be None here (if it is - ranges are not satisfiable)
if isinstance(data, str): if isinstance(data, str):
RESPONSE.write(data[start:end]) RESPONSE.write(data[start:end])
return True return True
...@@ -243,6 +274,7 @@ class BigFile(File): ...@@ -243,6 +274,7 @@ class BigFile(File):
'Content-Range: bytes %d-%d/%d\r\n\r\n' % ( 'Content-Range: bytes %d-%d/%d\r\n\r\n' % (
start, end - 1, self.getSize())) start, end - 1, self.getSize()))
# NOTE data cannot be None here (if it is - ranges are not satisfiable)
if isinstance(data, str): if isinstance(data, str):
RESPONSE.write(data[start:end]) RESPONSE.write(data[start:end])
...@@ -279,7 +311,7 @@ class BigFile(File): ...@@ -279,7 +311,7 @@ class BigFile(File):
data = self._baseGetData() data = self._baseGetData()
mime = self.getContentType() mime = self.getContentType()
RESPONSE.setHeader('Content-Length', len(data)) RESPONSE.setHeader('Content-Length', data is not None and len(data) or 0)
RESPONSE.setHeader('Content-Type', mime) RESPONSE.setHeader('Content-Type', mime)
if inline is _MARKER: if inline is _MARKER:
# by default, use inline for text and image formats # by default, use inline for text and image formats
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment