RSDK-7469: Add CaptureAllFromCamera() #3906

Rob1in · 2024-05-06T18:40:01Z

that way it seems that it just works for all built-in vision service in RDK.

EDIT: ~~Open question on the MimeType for the "resp.Image.Image"~~ ok now

github-actions · 2024-05-06T18:40:15Z

Warning your change may break code samples. If your change modifies any of the following functions please contact @viamrobotics/fleet-management. Thanks!

component	function
base	IsMoving
board	GPIOPinByName
camera	Properties
encoder	Properties
motor	IsMoving
sensor	Readings
servo	Position
arm	EndPosition
audio	MediaProperties
gantry	Lengths
gripper	IsMoving
input_controller	Controls
movement_sensor	LinearAcceleration
power_sensor	Power
pose_tracker	Poses
motion	GetPose
vision	ClassificationsFromCamera

bhaney

Nice that CaptureAllFromCamera is going to be finally be able to return all the things we need it too!

In addition to the changes requested below, please add tests for client, server, and vision service

bhaney · 2024-05-09T18:07:05Z

rimage/image_file.go

-			return nil, err
+			if mimeType == "" {
+				img, _, err = image.Decode(bytes.NewReader(imgBytes))
+				if err != nil {
+					return nil, err
+				}
+			} else {
+				return nil, err
+			}


I know we wrote this together, but actually I think what you can do is just put this in its own case statement, so that you don't have to check again if the mimeType is ""

so it would be

case "": img, err := DecodeJPEG(bytes.NewReader(imgBytes)) if err != nil { img, _, err = image.Decode(bytes.NewReader(imgBytes)) if err != nil { return nil, err } } return img, nil

bhaney · 2024-05-09T18:11:19Z

services/vision/client.go

+	returnImage bool,
+	returnDetections bool,
+	returnClassifications bool,
+	returnObject bool,


I would prefer that you create a struct that contains all of these bools together, rather than having to feed them in individually. It would also help with backwards compatability in case anything is ever added to the list

bhaney · 2024-05-09T18:11:53Z

services/vision/vision.go

+		returnImage bool,
+		returnClass bool,
+		returnDet bool,
+		returnObjPCD bool,


gather all of these bools into a struct, and pass the struct in as the argument. You can define this input within the viscapture package too

bhaney · 2024-05-09T18:12:46Z

services/vision/vision.go

+	returnImage bool,
+	returnClass bool,
+	returnDet bool,
+	returnObjPCD bool,


use a new struct for these bool inputs

bhaney · 2024-05-09T18:14:15Z

vision/viscapture/viscapture.go

+type VisCapture interface {
+	Image() image.Image
+	Detections() []objectdetection.Detection
+	Classifications() classification.Classifications
+	PointCloudObject() []*vision.Object
+}
+
+// NewVisCapture returns a VisCapture.
+func NewVisCapture(img image.Image,
+	dets []objectdetection.Detection,
+	class classification.Classifications,
+	obj []*vision.Object,
+) VisCapture {
+	return &capture{
+		image:           img,
+		detections:      dets,
+		classifications: class,
+		pcd:             obj,
+	}
+}


I'm curious why you chose to make an interface for VisCapture, rather than just making the struct Public. Was it in order so that the output could not be modified by a user? In that case, what you can do is make the struct public, but the member variables private, and then have the methods on it to retrieve the values from the member variables (as you have already done)

services/vision/server.go

bhaney · 2024-05-09T18:54:31Z

services/vision/server.go

+func imageToProto(ctx context.Context, img image.Image, mimeType string) (*v11.Image, error) {
+	imgBytes, err := rimage.EncodeImage(ctx, img, mimeType)
+	if err != nil {
+		return nil, err
+	}
+	return &v11.Image{Image: imgBytes}, nil
+}


Add checks for if a "" mimeType is passed in.

Check if its DepthMap or Gray16 (then encode as a ViamRawDepth Image)

Check if it is a LazyEncodedImage (then use the mimeType from the lazy encoded Image)

If it's none of those, you can default to a JPEG encoding

bhaney · 2024-05-09T18:54:45Z

services/vision/server.go

+	if err != nil {
+		return nil, err
+	}
+	return &v11.Image{Image: imgBytes}, nil


Also, Image has more fields than "Image", you should also store the"Format" that you ended up encoding it as. You also have the original camera name -- put that in the "SourceName" field

bhaney · 2024-05-09T18:56:34Z

services/vision/client.go

+		return nil, err
+	}
+
+	img, err := rimage.DecodeImage(ctx, resp.Image.Image, "")


The response Image gives you the "Format" as well. So you don't actually have to leave this blank -- you can know what the image is encoded as and decode it appropriately

bhaney · 2024-05-09T18:58:14Z

services/vision/client.go

+		return nil, err
+	}
+
+	capt := viscapture.NewVisCapture(img, dets, protoToClas(resp.Classifications), objPCD)


While protoToClas doesn't return an error, it was confusing at first because it seemed like you didn't return classifications. Go ahead and put this conversion on a line after protosToDets rather than doing the conversion in the New function.

Rob1in · 2024-05-15T17:45:50Z

services/vision/vision.go

+	defer release()
+	var detections []objectdetection.Detection
+	if opt.ReturnDetections {
+		if !vm.properties.DetectionSupported {


@bhaney I changed those tests to use the new vm.properites. Is that ok?

Rob1in · 2024-05-15T17:47:25Z

vision/classification/classifier.go

 	sort.Slice(cc, func(i, j int) bool { return cc[i].Score() > cc[j].Score() })
-	return cc[0:n], nil


the change to return all classifications when n=0

Rob1in · 2024-05-15T17:51:15Z

services/vision/server.go

+	return &camerapb.Image{
+		Image:      imgBytes,
+		Format:     format,
+		SourceName: cameraName,


it seems that SourceName gets lost when the proto is decoded in the image.Image....

luckily, SourceName is only really important for GetImages -- so I think this is fine

bhaney · 2024-05-15T21:23:30Z

services/vision/server.go

+	}, nil
+}
+
+func encodeUnknownType(ctx context.Context, img image.Image, defaultMime string) ([]byte, string, error) {


nice function!

bhaney

Nice work! LGTM!

viambot added the safe to test This pull request is marked safe to test from a trusted zone label May 6, 2024

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels May 9, 2024

bhaney requested changes May 9, 2024

View reviewed changes

Rob1in added 9 commits May 9, 2024 15:00

first commit

57b27ee

cleaner

08a8a40

clean and debug statement

64b6e5a

typo in span

b132ecb

remove lazy encoded image check:

187bdae

linting

17f287b

better comments

739dd52

review

ac1dffa

merge main

8169c4b

Rob1in force-pushed the RSDK-7469 branch from 0874ddd to 8169c4b Compare May 13, 2024 20:01

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels May 13, 2024

some tests

ec2c7bc

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels May 15, 2024

merge main

03aa815

viambot removed the safe to test This pull request is marked safe to test from a trusted zone label May 15, 2024

viambot added the safe to test This pull request is marked safe to test from a trusted zone label May 15, 2024

Rob1in requested a review from bhaney May 15, 2024 17:43

Rob1in marked this pull request as ready for review May 15, 2024 17:43

lint

6ec2d92

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels May 15, 2024

Rob1in commented May 15, 2024

View reviewed changes

fix test

b0d1fe2

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels May 15, 2024

bhaney reviewed May 15, 2024

View reviewed changes

bhaney approved these changes May 15, 2024

View reviewed changes

Rob1in merged commit 18d6caf into viamrobotics:main May 16, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RSDK-7469: Add CaptureAllFromCamera() #3906

RSDK-7469: Add CaptureAllFromCamera() #3906

Rob1in commented May 6, 2024 •

edited

Loading

github-actions bot commented May 6, 2024

bhaney left a comment

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

bhaney May 9, 2024

Rob1in May 15, 2024

bhaney May 15, 2024

Rob1in May 15, 2024

Rob1in May 15, 2024

bhaney May 15, 2024

bhaney May 15, 2024

bhaney left a comment

		sort.Slice(cc, func(i, j int) bool { return cc[i].Score() > cc[j].Score() })
		return cc[0:n], nil

RSDK-7469: Add CaptureAllFromCamera() #3906

RSDK-7469: Add CaptureAllFromCamera() #3906

Conversation

Rob1in commented May 6, 2024 • edited Loading

github-actions bot commented May 6, 2024

bhaney left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bhaney left a comment

Choose a reason for hiding this comment

Rob1in commented May 6, 2024 •

edited

Loading