Learn in our demo how to manually crop and split documents within a review queue in ABBYY Vantage.
Hello. Today I’m gonna walk you through how we set up manual review cropping on a document. And this happens in places where we actually get three separate documents on a given page or even within a given file. But this is an example of what you see on the screen. I have one page with three separate IDs on them, and really what we would need is those to be three separate documents. So in ABBYY Vantage, we have the ability to allow a human to tell us where on this page the documents start and stop so they can crop them. And then from there we can set up the rest of the extraction and classification processes automatically.
So this is very common, like what you’re gonna see in business. Somebody just gives us three documents, three in this case IDs on a given page. But we actually need those to be three separate documents for storage and automation downstream.
So what we’re gonna have here is what we refer to as a process skill. And today’s process is gonna be what you see here on the screen. We’re gonna bring a document in, we’re gonna send that to a cropping review screen. So that’s the very first thing that’s gonna happen is we’re gonna ask the human to crop this image to make sure that we’ve auto separated it. Now what that does is it will create one document with three separate IDs. So if you remember here, when we crop, what’s gonna happen is we’ll have three separate IDs, but all within a single document. So that actually creates three pages. That’s not quite what we need, but good enough for the user. So we don’t wanna burden the user with having to assemble the document manually. So we will do that automatically. We will assemble that through what we call our assemble activity, using our classification step. We will then extract the information from those IDs and then of course we’ll review the results together. But the idea is, is that a human’s gonna come into Vantage, crop that document for us, so we know where those IDs start and stop, and then we’ll automatically assemble them here.
So let me show you kind of this process. So we’re gonna go ahead and upload this ID PDF, and that’s going to go to our process skill.
Now this should happen pretty quick. What it’s gonna do is it’s gonna send us right into that cropping activity that you saw there on the workflow. So we would just hit our review button here. And what we’re gonna ask the end user is to look at this document and tell us where these IDs stop and start. So no big deal. We’re gonna crop this image here using our cropping tool. And all we need the end user to do is just tell us where these documents start and stop. So this looks like a good ID. This looks like a good ID. And lastly, this looks like a good ID as well. And we’re just gonna go ahead and apply the crop.
So this is what the software’s done now. So we have a document that came in and this document has three pages. Like I said, that’s not necessarily always ideal. Typically in downstream processes, we would actually want these to be three separate documents, not a document with three separate pages.
What we’re gonna do is once again, carry down the workflow. So we’re right here at the review and crop. Now we’re gonna tell the software to go ahead and assemble that for us, and then we’ll go ahead and extract. So let’s just kind of release this from our queue. We’ll save and close this. We don’t need to extract any data yet. And then we’ll go ahead and complete it.
Now this task will be completed. So what we’re gonna do is we’re gonna kind of see that here. The software’s gonna continue processing that image. And then what we will do is we will have a queue that shows three separate documents.
So this is now what we have. So we’ve cropped it. And now you can see here I have three separate documents, each of those that we’ve manually cropped here, but now we’ve classified them correctly as identity documents and we’ve extracted the data from them correctly. So we can kind of see here on these documents here. Not only do we know the document type, but obviously now that we know the document, we can use our intelligent document processing extraction technology to extract the critical details off of those documents. So then what happens from here on is really up to you as the citizen developer to take these documents over to data and perform interactions downstream.
But the critical part, once again, is knowing the workflow. Is we have this cropping mechanism that allows the end user to intervene and tell us where documents start and stop. And that’s all we’re asking that end user to do. And then the rest here, we will handle ourselves through our activities in the ABBYY Vantage Suite.
[Music- “Engineered to Perfection” performed by Peter Nickalls, used under license from Shutterstock.
Adobe, Acrobat, and the Adobe PDF logo are either registered trademarks or trademarks of Adobe in the United States and/or other countries.]