- [Databricks] DELTA_DELETION_VECTOR_SIZE_MISMATCH2025년 02월 28일 19시 34분 35초에 업로드 된 글입니다.작성자: DE 군고구마반응형
This time, I wrote this article after experiencing a very big(?) problem. It was my first time experiencing a problem while using Databricks, and above all, I had a very hard time resolving it because there was no related information.
To conclude, it was a bug in Databricks. In the end, I opened a CASE and the backend engineer at the headquarters resolved the problem.
⚠️ Problem found
We have a task that does DELETE. But one day, the following ERROR occurred and the DELETE failed. The message is as follows:
[DELTA_DELETION_VECTOR_SIZE_MISMATCH]
Deletion vector integrity check failed. Encountered a size mismatch.First, I instinctively googled, tried various measures, and even asked the AI provided by Databricks about the problem. In the end, it was said to do REFRESH TABLE, and after executing it, I executed DELETE again, but it did not work due to the same problem.
🛠️ Symptoms
Here is a list of symptoms. Some were possible, some were not. Some were only partially possible.
⛔ DELETE impossibility
⚠️ SELECT impossibility
⛔ OPTIMIZE impossibility
✅ INSERT possibility
✅ vacuumpossibilityAnd there was also an ERROR message that said there was a problem reading the file, so I found the file, downloaded it, and read it directly, but there was no problem at all. In conclusion, it did not seem to be a problem with the file.
💡 Conclusion
And as for the ERROR message, it was a problem that could not be solved in conclusion. Databricks admitted it was a bug, and after fixing it, I confirmed that everything was working normally. In the end, after struggling for 3~4 days, I opened the CASE, and it said it was a bug right away, so I was a little... discouraged. So if there is no Google or documentation, don't worry and open the CASE. There was also an article that said there was a problem reading this file, so I looked for the file, downloaded it, and read it directly, but there was no problem at all. In conclusion, it didn't seem to be a file problem.
반응형'Databricks' 카테고리의 다른 글
[Databricks] targetFileSize 테이블 옵션 (1) 2025.04.30 [Databricks] S3에 있는 파일을 테이블 처럼 읽기 (0) 2025.04.25 [Databricks] DELTA_DELETION_VECTOR_SIZE_MISMATCH (0) 2025.02.27 [Databricks] 여러 폴더를 외부(external) 테이블로 만들기 (0) 2024.12.26 [Databricks] Optimize / VACUUM (4) 2024.12.02 다음글이 없습니다.이전글이 없습니다.댓글